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A shift in climate 
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The Intergovernmental Panel on Climate Change has done much to alert politicians to the effects of 
global warming. But to push climate change up the agenda, it will need to do the same for the public. 


chair of the Intergovernmental Panel on Climate Change 

(IPCC). The South Korean economist says that he wants to 
increase coordination between the working groups, work with more 
scientists from developing countries, boost interaction with the 
business and financial industries and expand the panel's influence 
by making its findings easier to digest. Above all, Lee says that he 
wants to be remembered as the man who shifted the panel’s focus 
towards solutions. Those are all worthy goals, but the organization 
that assesses — and in some senses oversees — the world of climate 
science is well placed to do a lot more. 

The basic science underlying climate change has been firmly estab- 
lished, and we now know much more about not only the threats posed 
by a rapidly warming world, but also the options humanity has for 
changing course. Even though the commitments that have been prof- 
fered going into the United Nations climate summit in Paris at the 
end of next month are generally tepid, the governments of the world 
by and large recognize the problem and know that they need to act. 
What else can the IPCC do? The answer is plenty, and Lee’s emphasis 
on solutions is one piece of the puzzle. 

Most of the world’s major emitters — rich, poor and in 
between — have offered commitments and policies to move their 
countries in the right direction. Without concerted action in the dec- 
ades to come, however, these commitments will fall well short of what 
the best science suggests is needed to achieve the formal goal of limit- 
ing the rise in global average temperatures to 2 °C. Political leaders are 
leaving the hard work for later, and could well be committing future 
generations to more warming than anybody wants to experience. 

This is partly because of a lack of political leadership and of active 
opposition by entrenched industrial interests, as environmentalists 
argue. But it is also evidence of the vastness of the challenge. It is not 
easy to transform the global economy and industrial base for a large 
and growing population, much of which is still mired in poverty but 
wants access to the modern conveniences that so many on the planet 
take for granted. The world needs a full suite of technologies that are 
not only cost-effective but also socially and politically viable. Here, the 
IPCC has a particular part to play. 

The weak commitments going into Paris are also evidence of a dis- 
connect between scientists, who think that the evidence speaks for 
itself, and citizens and policymakers, who have a lot of other things 
on their minds. The IPCC is looking at bringing science writers and 
graphics experts on board in an effort to improve its reports. A lin- 
guistics study published earlier this month showed that the IPCC 
summaries for policymakers score low in terms of readability, and 
recommends that key panel members receive science-communication 
training (R. Barkemeyer et al. Nature Clim. Change http://doi.org/79f; 
2015). All this makes sense, but communicating the science more 
clearly is just the first, and a relatively minor, step. 


| l oesung Lee has laid out a vision for his tenure as the fourth 


The IPCC’s reports are aimed mainly at — and written in coordi- 
nation with — governments, yet politicians at the very highest levels 
are already talking about climate change. The unfortunate truth is 
that taking steps to combat climate change is way down the politi- 
cal agenda, and that makes more aggressive action difficult. The real 
challenge is to raise public awareness about the risks of inaction — as 
well as the benefits of action — and to identify policies that can pass 
the political litmus test. 

Here, as Lee himself has said, the IPCC has an important role. The 
panel must generate and incorporate knowledge about how infor- 

mation filters through society and about the 


“The real kinds of policies that are most likely to work. 
challenge is This is the domain of sociologists, psycholo- 
to raise public gists, anthropologists and political scien- 
awareness tists, and they must be an integral part of the 
about the risks IPCC’s sixth assessment. 


The IPCC has had its controversies, 
including a glitch in its 2007 projection for 
Himalayan glacier melt and this year’s resignation of former chair- 
man Rajendra Pachauri, who faces — and denies — accusations of 
sexual harassment. But the challenges that face the panel today are in 
many ways a result of its success. Much ground has been covered; the 
challenge now, for both researchers and the IPCC, is to adapt and to 
identify research that will help policymakers to bridge the gap between 
what they say they want to do and what they are actually doing. = 


of inaction.” 


After Asilomar 


Scientist-led conferences are no longer the best 
way to resolve debates on controversial research. 


nia’s rocky Monterey Peninsula to discuss the nascent science of 
mixing DNA from different organisms. 

Until that point, researchers had deliberately not performed the 
final steps of such experiments, owing to concerns about safety and 
ethics. Over three days of discussions, the conference attendees agreed 
to voluntary restrictions on recombinant-DNA research, and drafted a 
document that listed the potential risks of such experiments and how 
to carry out the work safely. 

The meeting is seen as the first time that science had regulated 
itself — effectively avoiding government intervention — and assuaged 
public fears by addressing biosafety concerns head-on. 

Today, no scientific controversy is complete without calls for an 


lE 1975, some 140 scientists met at the Asilomar resort on Califor- 
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‘Asilomar-like conference. Until such a conversation has taken place, 
proponents say, researchers should not proceed with risky propositions. 
Debates on artificial intelligence, autonomous weapons, geoengi- 
neering and the use of gene-editing technology have all referred to 
Asilomar as a useful model. (Geoengineers went so far as to meet in 
Asilomar.) This month, a group of scholars, programmers, artists, 
entrepreneurs and video-game developers published a Biosphere Code 
to protect people and the planet from the negative impact of computer 
algorithms — produced after Asilomar-like discussions in Stockholm. 
Butis Asilomar’s reputation deserved? The invitation-only conference 
included a handful of journalists and policymakers, but did not cast a 
wide net outside the scientific community. And in hindsight, many of 
its safety precautions may have been overkill. As bioethicist Jonathan 
Moreno puts it: “Asilomar has become a bio-Woodstock in people’s 
memories, a golden age. People forget how muddy Woodstock was.” 
Modern science is muddier still: in a 2008 essay in Nature, even Asilo- 
mar organizer Paul Berg admitted that such a conference would be dif- 
ficult to convene today (P. Berg Nature 455, 290-291; 2008). In 1975, he 
and his colleagues had yielded to concerns from within their tight-knit 
community. They could afford to pause their research, having reason- 
able certainty that the technology would not advance in the meantime. 
But like everything else in the twenty-first century, science has 
become a global affair. An enormous number of researchers have 
almost unfettered access to information and increasingly easy-to-use 
tools. As a result, ‘synthetic organisms, enhanced influenza viruses 
and genetically modified human embryos already exist, whether the 
world is ready for them or not. Even if they are destroyed, the instruc- 
tions to make them will inevitably make their way onto the Internet — 
a technology as pervasive and uncontrollable as any biological entity. 
Modern science is also less insular than that of the past, and any 
single Asilomar conference would probably be lost in the noise. New 
players have appeared over the past four decades, including a powerful 
biotechnology industry driven at least in part by profit; the most polar- 
ized US government in history, which can turn any new technology 
into a political weapon; and a mass of religious and activist groups that 


have flexed their muscles to stop research on embryonic stem cells and 
genetically modified organisms in their tracks. 

Each represents and interacts separately with the general public. And 
scientists who wish to self-regulate ignore public outcry at their peril: 
crowd-pleasing politicians passing knee-jerk regulations will hinder 
scientific progress more than any voluntary moratorium ever could, and 
their poor understanding might cause collateral damage to related fields. 

When controversy comes calling, rather 


“When than asking for an Asilomar conference — 
controversy which, after all, was closed to the public — 
comes calling, scientists should reach outwards. Discussions 
scientists should extend beyond researchers and ethi- 
should reach cists to include, or at least broadcast to, the 
outwards.” broader public. Proactive engagement with 


the mass media is key: the most transparent 
of webcasts is meaningless if only a rarified group of already-interested 
individuals knows that the meeting is happening. 

The most important thing is to communicate the risks and benefits 
of controversial research in a responsible and transparent way. For 
embryo editing, for instance, discussions should avoid unhelpful ref- 
erences to the genetically modified humans in the 1997 film Gattaca, or 
veiled aspersions on the ethical standards of non-Western researchers. 

A modern Asilomar might also take advantage of the wide range of 
expertise and techniques available today. Some advisory bodies, such 
as the US National Academies’ committee on research that involves 
enhancing influenza viruses, have had the forethought to include 
economists and futurists expert in drawing up realistic risk—benefit 
analyses and scenarios. Other strategies could include a war-game 
approach similar to the 2001 Dark Winter exercise, in which US 
media, government officials, health experts and military groups simu- 
lated a bioterror attack to anticipate the problems that would arise. 

In 1975, the week after the Asilomar conference, actor Telly Savalas 
— star of the detective programme Kojak — topped the UK music 
charts with a spoken-word version of If by rock group Bread. The 
world has moved on since then; science must as well. = 


The worm returns 


The wiring diagram of the male nematode’s 
nervous system is only a beginning. 


hen scientists seemed to have completed the map of the nerv- 
We: system of a tiny male worm in 2012 (T. A. Jarrell et al. 

Science 337, 437-444; 2012), some researchers were already 
questioning whether the whole effort, originating some 40 years before, 
was truly worth it. The construction of the wiring diagram for the nerv- 
ous system of the male of the nematode species Caenorhabditis elegans 
built on the wiring diagram for the hermaphrodite, established more 
than 25 years earlier, and required painstaking tracing of how the male 
worms extra neurons connected to each other. 

Stephen Hawking was talking metaphorically when he famously 
wrote that to unravel the laws of nature would be to know the mind of 
God. The C. elegans project was quite literal: as Sydney Brenner, the 
originator of the project, jokingly entitled the manuscript of a land- 
mark 1984 paper, the scientists really did want to know ‘the mind of 
the worm. And in doing so, they argued, they could learn more about 
how brains create behaviour in higher organisms all the way up to 
humans. 

Can we really know the mind of a worm? Three years on we have an 
answer of sorts: possibly. In fact, it turns out that we did not even find 
all the neurons that comprise the male worm’s mind. For on page 385, 
researchers including two of the 2012 team publish an appendix to 
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the wiring diagram of the male C. elegans. As well as the 383 neurons 
already identified, they describe the discovery of neurons number 384 
and 385, which they found in the worm’s head. 

There is much to admire about the new work, not least that the 
researchers chose to call the new cells — mystery cells found in the 
male worms — MCMs. No metaphor there either. It stands for mystery 
cells of the male. 

What's on a male nematode’s MCMs? Not so much ofa mystery as it 
turns out: sex. The new neurons have an old role, and help the worms 
to learn to prioritize the search for a mate over the need for food. When 
these neurons are put out of action, the male worms never discover 
the facts of life. The findings offer much more than a completion of 
the neural map of the male worm. In most organisms, sex-specific 
differences in behaviour extend to cognitive-like processes such as 
learning, which can aid reproductive success, but the underlying neu- 
ral mechanisms are mostly unclear. 

The discovery of the MCMs, and the subsequent experiments with 
them, link developmental and anatomical sex differences in high- 
order processing areas of the brain to sex-specific behaviour during 
learning. In doing so, they help to shed light on the neural basis of 
sex differences in behaviour. And they show that these neurons arise 
upon sexual maturation from specialized cells called glia — unlike 
other neurons in C. elegans and other invertebrates, which arise from 
epithelial or undifferentiated blast cells. 

Was the effort to trace out the connections 
between the male nematode’s neurons worth it? 
Like all good maps, the wiring diagram of the 
C. elegans is best viewed as a starting point. The 
final destination is sure to surprise us. = 
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be doing!” are a few of the Twitter responses to last week’s news 


cc [ee “a joke’, and “exactly the sort of thing the NHS should not 


that Britain's Princess Alexandra Hospital NHS Trust wants to hire 
a reiki therapist for a hospital in Epping. On a salary of up to £22,236 
(US$34,000) a year, the appointed person “will provide Reiki/Spiritual 
healing to patients to enable them to cope with the emotional, physical 
and spiritual issues of dealing with their cancer journey”. 

Critics of the advert — and there are many — advocate instead what 
they call “evidence-based” approaches to health care. These critics 
should look again at the evidence — because it shows that to dismiss 
the benefits of alternative therapies is simplistic and misguided. 

Let’s be clear, I don’t buy into the pseudoscientific claims of reiki 
and spiritual healers. There is no evidence that they can tap into and 
manipulate human ‘energy fields’ to clear block- 
ages and heal the body. Like many alternative 
therapies, these practices perform no better than 
placebos in clinical trials. 

But that does not mean that such treatments 
have no distinct therapeutic value. To dismiss 
people's complex psychological and physiological 
reactions to serious illness — and howitis treated 
—as mere placebo effects is not helpful. 

Neuroscience studies show that placebo effects 
can trigger significant physiological responses 
that are often identical to those created by drugs, 
ranging from the release of dopamine in the 
brains of people with Parkinson's disease to a 
rush of endorphins for those in pain. 

The standard ‘evidence-based’ argument is 
that this is irrelevant. Even if alternative therapies 
induce a biological response, sceptics argue, patients are still better off 
receiving trial-proven conventional treatments, because then they ben- 
efit from both a placebo effect and the active effect of the drug. 

This logic misunderstands the nature of placebo effects. Not all 
placebos are the same, and alternative therapies can sometimes trig- 
ger larger responses than conventional ones do. For example, in one 
trial, fake acupuncture relieved pain more effectively than a fake pill 
(T. J. Kaptchuk et al. Br. Med. J. 332, 391-397; 2006); in another, it 
relieved symptoms of irritable bowel syndrome with fewer side effects 
than available drugs (T. J. Kaptchuk et al. Br. Med. J. 336, 999-1007; 
2008). It is true that if a therapy cannot beat a fake version of itself in 
trials, it is not working as the therapist claims. But if it triggers a big 
enough placebo effect, it might still be the best treatment available. 

If drugs are effective and placebo responses small, this does not 
matter much. But people tend to turn to alter- 


native medicine for subjective, stress-related DNATURE.COM 
conditions such as chronic pain, depression, _ Discuss this article 
nausea and fatigue (all problems that can affect _ online at: 


cancer patients in treatment). Drugs for these —_go.ature.com/xp26zy 


CONVENTIONAL 


MEDICINE 


OFTEN 


STRUGGLES 


TO PROVIDE 


HUMAN 


ASPECTS OF CARE. 


Consider all the evidence 
on alternative therapies 


Investigate and incorporate the mechanisms of complementary medicine 
instead of rejecting it outright, says Jo Marchant. 


conditions have significant downsides, such as unpleasant side effects 
and addiction, and placebo responses often account for most of the 
effect of the drug. So it becomes plausible that compared to popping 
a pill, a patient might get more relief — and fewer side effects — from 
an hour with a sympathetic therapist. 

The benefits of therapies such as reiki and acupuncture go beyond 
what we normally think of as placebo effects, however. Alternative 
therapists do not get results just because they are particularly good at 
fooling people into thinking that they will get better. Many elements 
of the care they provide — from talking to touch — seem to have the 
power to relieve symptoms and even influence physical outcomes. 
These elements do not show up when therapies are compared against 
sham treatments, because they are present in both arms ofa trial. 

Such benefits can be indirect. For example, 
tackling patients’ anxiety during invasive proce- 
dures such as keyhole surgery can reduce the risk 
of dangerous fluctuations in heart rate. This results 
not only from the direct effects on physiology, but 
also probably from patients needing lower doses 
of sedatives and painkillers. 

Conventional medicine, with its squeezed 
appointment times and overworked staff, often 
struggles to provide such human aspects of care. 
One answer is to hire alternative therapists. 

This ensures that such therapies are regulated, 
and that patients also get the conventional treat- 
ment they need. Such ‘integrative medicine’ is now 
offered by dozens of major US academic medi- 
cal institutes. The Stanford Center for Integrative 
Medicine in California offers acupuncture to help 
with chemotherapy side effects. If this helps patients to complete a con- 
ventional treatment by making those symptoms bearable, one therapist 
there told me, it might improve survival rates, too. 

Critics say that this is dangerous quackery. Endorsing therapies that 
incorporate unscientific principles such as auras and energy fields 
encourages magical thinking, they argue, and undermines faith in 
conventional drugs and vaccines. That is a legitimate concern, but 
dismissing alternative approaches is not evidence-based either, and 
leaves patients in need. 

Instead of rejecting such approaches wholesale, let’s learn from 
them. That means going beyond the simplistic practice of jettisoning 
anything that cannot beat placebo. We must tease out the real active 
ingredients of these therapies — things such as ritual, mental imagery, 
empathy, care and hope — so that we can learn how they work and 
find ways to incorporate them into patient care. m 


Jo Marchant is a science journalist and author of Cure: A Journey 
into the Science of Mind Over Body, to be published in January 2016. 
e-mail: jomarchant26@yahoo.co.uk 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


IMMUNOLOGY 


Infections ‘scar’ 
immune system 


After the body has cleared a 
gut bacterial infection, some 
intestinal tissues have long- 
lasting weakened immunity — 
partly because of gut microbes. 

Infection can lead to chronic 
inflammatory disorders. To 
find possible mechanisms, 
Yasmine Belkaid at the 
National Institute of Allergy 
and Infectious Diseases in 
Bethesda, Maryland, and her 
colleagues infected mice witha 
foodborne pathogen, Yersinia 
pseudotuberculosis, and then 
monitored them for up to six 
months. They found a variety 
of changes that weaken the 
immune system in gut tissues. 
For example, immune cells 
called dendritic cells were 
diverted into fat tissue away 
from lymph nodes, where they 
would normally activate the 
immune response. Signals from 
gut microbes also seemed to 
maintain inflammation. 

The results show how 
frequent infections could lead 
to chronic disease later in life. 
Cell 163, 354-366 (2015) 


CHEMISTRY 


Cheap absorber 
for solar cells 


An iron-based chemical can 
absorb and convert light into 
electrons with 92% efficiency 
— making it a promising 
material for solar cells. 
Light-harvesting ‘sensitizers’ 
in solar cells are typically made 
of rare elements, which are 
expensive to scale up. To find 
a cheaper alternative, Kenneth 
Warnmark at Lund University 
in Sweden and his colleagues 
engineered an iron-based 
sensitizer that captures photons 
and transfers their energy to 
electrons in a similar way to 
those based on rare elements. 


METEOROLOGY 


Winged weather watchers 


Soaring birds equipped with sensors that track their location 
could be used to estimate weather variables such as wind speed. 

Jelle Treep at the University of Amsterdam and his team 
analysed Global Positioning System (GPS) data from four 
griffon vultures (Gyps fulvus; pictured) as the birds flew across 
the Grand Causses region of southern France. By tracking the 
birds’ in-flight location at three-second intervals and using 
knowledge of airborne bird movements, the team estimated 
wind speed and direction and vertical air movement. These 
GPS estimates agreed with ground data at three local weather 
stations and were consistent with meteorological theories. 

As GPS equipment becomes smaller, lighter and able to 
collect higher-resolution data, bird-borne trackers could 
become an important tool for meteorological surveys in 


remote areas, the authors say. 


Bull. Am. Meteorol. Soc. http://doi.org/768 (2015) 


The sensitizer converts photons 
with 92% efficiency: 12% 
better than the previous best 
sensitizer based on iron. 

Some electrons quickly 
combined with positive 
charges, limiting the effective 
current generated. Still, 
the authors say that using 
abundant materials such as iron 
as a sensitizer for photovoltaics 
opens up possibilities for 
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low-cost solar energy. 
Nature Chem. http://dx.doi. 
org/10.1038/nchem.2365 (2015) 


First ancient 
African genome 


A 4,500-year-old human 
skeleton froma cave in 
Ethiopia has produced Africa's 


first ancient genome sequence. 

Marcos Gallego Llorente at 
the University of Cambridge, 
UK, and his colleagues 
sequenced genomic DNA from 
one of the bones and compared 
it with sequences from modern 
Africans and Eurasians, as well 
as ancient Europeans. They 
conclude that the ancestors of 
modern Ethiopian highlanders 
were related to early farmers 
who moved into Europe 
from western Eurasia around 
9,000 years ago. Descendants 
of these people later moved 
back to Africa around 
3,000 years ago. 

Most Africans today have 
4—7% Eurasian ancestry 
because of this migration, 
suggesting that this event was 
larger and more significant 
than was thought. 

Science http://doi.org/78d (2015) 


| __NEUROSCIENCE 
People identified 
from brain activity 


A map of connections between 
brain regions that are active 
during mental activity can be 
used as a unique, reproducible 
‘fingerprint to identify 
individuals. 

Emily Finn of Yale University 
in New Haven, Connecticut, 
and her colleagues studied data 
from the Human Connectome 
Project, which is mapping all 
of the structural and functional 
connections in the brain. They 
looked at 126 people whose 
brains were scanned while they 
were resting or doing certain 
tasks. By analysing patterns 
of neural connectivity, the 
team identified subjects with a 
success rate of more than 90% 
when comparing rest scans, 
and with 54-87% success 
when comparing brain activity 
during tasks. The most useful 
networks for identifying people 
were those in certain regions of 
the cerebral cortex that control 
attention, memory and other 
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How elephants 
dodge cancer 


Elephants have extra copies of 
a gene that fights tumour cells, 
which could explain why they 
rarely develop cancer. 

Joshua Schiffman at the 
University of Utah in Salt Lake 
City and his colleagues studied 
elephant white blood cells and 
found that they have 20 copies 
of a tumour-suppressor gene 
called TP53 in their genome — 
humans and other mammals 
have only one. The cells also 
underwent TP53-mediated 
apoptosis — programmed cell 


cognitive functions. 

The results provide a 
foundation for future work 
to link functional brain 
connections with individual 
behaviours, the authors say. 
Nature Neurosci. http://dx.doi. 
org/10.1038/nn.4135 (2015) 


Early human witha 
familiar handshake 


A recently discovered early 
human species probably 
walked upright and wielded 
tools, but also took to the trees. 
Last month, researchers 
reported the discovery of 


Popular topics 


SOCIAL SELECTION 


Nobel prizes prompt surprise online 


on social media | 


This year’s Nobel prizewinners seemed to surprise many 
researchers, judging by their reactions on social media. The 
chemistry prize recognized discoveries in DNA repair, yet 
was not awarded to the scientists who won the prestigious 
Lasker prize earlier this year for research ina similar area. 
Many speculated that the physics prize would go to a woman 
for the first time in more than 50 years, but it went to two men 
for their work on neutrinos instead. One of the scientists who 
shared the Nobel Prize in Physiology or Medicine, Chinese 
pharmacologist Youyou Tu, discovered a malaria medicine 
called artemisinin after studying traditional Chinese medicine 
texts. As chemist Ashutosh Jogalekar wrote on his blog (go. 
nature.com/loeard): “The story of 


fossil bones from at least death — more frequently than > NATURE.COM artemisinin clearly indicates that we need 
15 individuals of a species human cells do when exposed For more on to pay much more attention to forgotten 
they named Homo naledi. to DNA-damaging radiation. popular papers: examples from traditional Asian 

A team led by Tracy Kivell This suggests that elephant cells go.nature.com/d2hnyf © medicine and subject them to scrutiny.” 


BLUE BRAIN PROJECT 


at the University of Kent in 
Canterbury, UK, has analysed 
nearly 150 hand bones from 
the find, including a complete 
right hand (pictured, left). 
The hands resemble those of 
Homo sapiens, Neanderthals 
and other regular tool-users, 
although the long, curved 
fingers suggest that H. naledi 
was comfortable in trees. 

Ina separate study, William 
Harcourt Smith at the City 
University of New York and 
Jeremy DeSilva at Dartmouth 
College in Hanover, New 
Hampshire, looked at 107 foot 
bones, including a nearly 
complete right foot (pictured, 
right), and concluded that 
H. naledi strode upright. 
However, the feet still had 
some primitive features: 
certain toe bones were more 
curved than are those of 
modern humans. 


kill themselves to avoid the risk 
of uncontrolled growth. 

Ina separate study, Vincent 
Lynch at the University of 
Chicago in Illinois and his 
co-workers report similar 
results. They also discovered 
more than a dozen TP53 
copies in two extinct species of 
mammoth, but just one copy 
in manatees and in small furry 
mammals called hyraxes — 
both close living relatives of 
elephants. The extra copies may 
have evolved as the animals in 
the elephant lineage expanded 
in size, says the team. 

J. Am. Med. Assoc. http://doi. 
org/772 (2015); Preprint at 
bioRxiv http://doi.org/773 (2015) 


Corals cope with 
pH-altered waters 


Some corals seem to be 


corals and other marine 
organisms to make skeletons 
and shells. Lucy Georgiou 
at the University of Western 
Australia in Perth and her 
colleagues exposed colonies 
of Porites cylindrica coral 
on Australia’s Great Barrier 
Reef to flumes of modified 


sea water. This lowered the 
ambient pH around the 
animals so that it was similar to 
conditions that are predicted 
for oceans at the end of the 
century. After six months, 
the researchers found no 
difference in the growth rate of 
the corals’ skeletons between 
controls and those living in 
lower pH conditions. 

The corals naturally produce 
a fluid that bathes the growing 
parts of their skeletons, and 
the team found that the fluid 
had a higher pH than the reef 
waters for all the corals in the 
experiment. This suggests that 
some corals can regulate their 
internal pH to tolerate a certain 
level of ocean acidification, the 
authors say. 
Proc. Natl Acad. Sci. USA 
http://doi.org/77b (2015) 


simulated the activity of 

31,000 virtual brain cells 

(pictured) in a section of rat 

brain the size of a grain of sand. 
Henry Markram at the Swiss 

Federal Institute of Technology 

in Lausanne and his team 

built their model based on 

experimental measurements 


of rat brain slices. The 


simulation represents roughly 
37 million synapses, or 
neuronal connections, in the 
brain region that receives 
sensory information from the 
whiskers and other parts of 
the body. Using the model, the 
team simulated rat whisker 
movement and saw similar 


az Nature Commun. 6, 8431; 6, 8432 
25 (2015) resilient to ocean acidification. | —sNEUROSCIENCE = neuronal responses to those 
& - As carbon dioxide observed in rat experiments. 
i 2 emissions rise, ocean waters Computer model The model could be 
5 are absorbing more of the gas of rat-brain p art manipulated in ways that are 
= and becoming less alkaline, difficult to do experimentally, 
threatening the ability of A supercomputer has providing insights into how 


individual cells contribute 
to the functions of neuronal 
networks, the authors say. 
Cell 163, 456-492 (2015) 
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CORRECTION 

The print version of the 
Research Highlight ‘Corals 
cope with acidified waters’ 
(Nature 526, 296-297; 
2015) incorrectly stated that 
ocean water is being acidified 
when in fact it is becoming 
less alkaline; the online title 
was changed to reflect that. 

It also said coral-made fluid 
was less acidic than reef 
waters; in fact, the fluid had 

a higher pH. And it said that 
some corals can control the 
pH of surroundings, whereas 
they control their internal pH. 
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SEVEN DAYS 


EVENTS 


Gene patent invalid 
The High Court of Australia 
has ruled a key gene patent 
invalid, casting doubt on the 
status of other gene patents 
worldwide. On 7 October, the 
court determined that isolated 
DNA corresponding toa 
cancer-associated gene called 
BRCAI was not “a manner 

of manufacture’, and thus 
could not be patented under 
Australian law. The decision 
puts an end to a five-year legal 
battle. The US Supreme Court 
ruleda similar patent invalid 
in 2013. 


Start-up lab 

The venture-capital firm 

Y Combinator (YC) in 
Mountain View, California, 

is establishing a non-profit 
arm devoted to basic research 
in fields ranging from 
information technology to 
biology. YC Research will 
employ its own scientists and 
will initially consist of just one 
lab. YC president Sam Altman 
says that he is contributing 
US$10 million of his own 
money to the enterprise, 

but that most details are still 
confidential. 


Orca breeding ban 
The California Coastal 
Commission has banned 

the breeding of killer whales 
(Orcinus orca) at the SeaWorld 
theme park in San Diego. 
SeaWorld was granted 
permission to expand its orca 
tanks in the park, but said 
that it was “disappointed” by 
the commission’s conditions, 
which company president 
Joel Manby said would be 
“inhumane? in depriving 

the animals of “the natural 
and fundamental right to 
reproduce” SeaWorld has 
been increasingly targeted 

by campaigners who say 

that keeping killer whales in 
captivity is cruel; the company 


The news in brief 


Sea warming triggers coral bleaching 


The US National Oceanographic and 
Atmospheric Administration (NOAA) stated on 
8 October that Earth’s oceans are experiencing a 
mass coral-reef bleaching (pictured here in fire 
coral in Bermuda). Sea surface temperatures 
have risen enough to bleach reefs in three 

major coral-containing basins — the Pacific, 


denies these claims. The ruling 
does not affect SeaWorld sites 
outside California. 


AWARDS 


DNA-repair Nobel 
The 2015 Nobel Prize in 
Chemistry was awarded to 
three scientists who mapped 
how cells repair damaged DNA. 
Tomas Lindahl, Paul Modrich 
and Aziz Sancar shared the 
prize, announced on 7 October. 
Each discovered a different 
molecular process. Lindahl 
described how enzymes seek, 
cut out and patch up sections of 
damaged DNA, a mechanism 
called base-excision repair. 
Modrich worked on mismatch 
repair, which sorts out errors 
that are introduced when 

DNA is copied. And Sancar 
contributed research on the 
nucleotide-excision repair 
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system, which explains how 
cells use enzymes to repair 
damage caused by ultraviolet 
light. See page 307 for more. 


Cell lines unchecked 
More than half of biomedical 
researchers do not authenticate 
the cell lines that they use in 
their experiments, a survey 
published on 12 October 
shows (L. P. Freedman et al. 
BioTechniques 59, 189-192; 
2015). Of the 446 scientists 
surveyed, 52% said that they 
do not do checks for species, 
tissue type and sex, with many 
citing cost and time constraints 
as the reason. Concern about 
reproducibility of results and 
wasted research funds because 
of misidentified cells led the 
non-profit Global Biological 
Standards Institute in 


Indian and Atlantic oceans. It is the third global 
bleaching event in recorded history, after those 
in 1998 and 2010; as in the previous events, the 
El Nifio weather pattern is helping to keep ocean 
temperatures high. By the end of 2015, 38% of 
the world’s coral reefs could be affected, NOAA 
says. See go.nature.com/1h3bmx for more. 


Washington DC to undertake 
the survey. See go.nature.com/ 
ddo2bz for more. 


Telescope start 


Construction began on 

9 October of a 23-metre 
telescope at Roque de los 
Muchachos Observatory 

on La Palma, in Spain's 
Canary Islands. The 
instrument is destined 

to be part of the world’s 
largest y-ray observatory, 

the Cherenkov Telescope 
Array, an international 
project expected to cost more 
than US$300 million, with 

its Southern Hemisphere 
counterpart in Paranal, 

Chile. The dish will detect 
faint flashes of Cherenkov 
radiation — blue light emitted 
by the showers of electrons that 
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y-ray photons unleash when 
they hit the atmosphere. The 
telescope is expected to start 
operating by late 2017. 


Community lasers 
A powerful laser facility 

near Prague is due to open 
officially on 19 October. The 
€250-million (US$284-million) 
centre in the Czech Republic 
marks the first installation of 
the three-pronged Extreme 
Light Infrastructure (ELI), an 
experiment by the European 
Union to build large research 
facilities in countries that could 
not ordinarily afford them (see 
Nature 500, 264-265; 2013). 
The ELI will largely be paid for 
by European structural funds, 
which typically finance civic 
projects such as road repair 
and waste clean-up. It is set to 
include two other laser facilities 
in Hungary and Romania. 


Richard Heck dies 


Nobel-prizewinning chemist 
Richard Heck (pictured) died 
on 9 October, aged 84. Heck 
was a professor emeritus at 
the University of Delaware in 
Newark, and lived in Manila. 
He shared the 2010 Nobel 
Prize in Chemistry with Ei-ichi 
Negishi and Akira Suzuki for 
work he did in the 1960s and 
1970s on reactions that link 
two carbon atoms together 
using a palladium catalyst. 
His palladium cross-coupling 


TREND WATCH 


The summary findings of 

the Intergovernmental Panel 

on Climate Change (IPCC) 

are becoming increasingly 
unreadable, a study shows 

(R. Barkemeyer et al. Nature 
Clim. Change http://doi.org/79f; 
2015). The IPCCs first assessment 
report, from 1990, scored highest 
in the Flesch reading ease test, 
which analyses word complexity 
and sentence structure. Its 2014 
report scored lowest. By contrast, 
news stories about the IPCC 

are becoming easier to read. See 
go.nature.com/4ithkr for more. 


reaction, known as the Heck 
reaction, opened up a way 

for chemists in fields from 
pharmaceuticals to electronics 
to make myriad molecules 
more easily. 


Sexual harassment 
Geoff Marcy, an astronomer 
at the University of California, 
Berkeley, renowned for his 
work on exoplanets, has been 
found to have violated campus 
sexual-harassment policy. 
BuzzFeed News revealed 

the findings, which were 
confirmed to Nature by a 
university spokesperson, on 

9 October. The investigation 
was triggered by formal 
complaints, and found that 
Marcy had violated policies 

in anumber of incidents 
involving students between 
2001 and 2010. Marcy will face 
consequences — “sanctions 
that could include suspension 
or dismissal’, according to 

the university — only ifhe 
continues to harass students. 


COMPLEX CLIMATE REPORTS 


Marcy apologized in an 

open letter to the astronomy 
community. On 12 October 
members of the university's 
astronomy department 
released a statement advocating 
that Marcy be removed from 
the faculty. See go.nature.com/ 
wqv2ng for more. 


| BUSINESS 
CRISPR alliance 


Genome-editing company 
Caribou Biosciences of 
Berkeley, California, and 
industrial chemicals giant 
DuPont ofWilmington, 
Delaware, announced on 

8 October that they would join 
forces to exploit CRISPR/Cas9, 
a system used to make targeted 
changes to genomes. DuPont 
will gain exclusive rights to use 
Caribou’s patented CRISPR/ 
Cas9 technology in certain 
crops, and plans to bring 
genome-edited crops to market 
within the next decade. 


Heart drug halted 
Drug company Eli Lilly 
announced on 12 October that 
it was prematurely halting the 
phase III trial of its heart- 
disease drug evacetrapib. 

The drug, developed to 

treat people with a high risk 
of their arteries hardening 
and narrowing, is part ofa 
family of cholesterylester 
transfer protein inhibitors, 
which increase fat-removing 
particles called high-density 


The IPCC’s summaries for policymakers (SPMs) are getting harder to 
read, although media stories about the reports are written more clearly. 
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SEVEN DAYS | THIS WEEK | 


14-16 OCTOBER 
The European Food 
Safety Authority holds 
its second scientific 
meeting as part of the 
World Expo 2015 in 
Milan, Italy. 
go.nature.com/zbihwt 


14-17 OCTOBER 
Dallas, Texas, hosts the 
Society of Vertebrate 
Paleontology’s 75th 
anniversary meeting. 
go.nature.com/hmmzkm 


20-22 OCTOBER 

The 11th World 
Conference on Bioethics, 
Medical Ethics and 
Medical Law takes place 
in Turin, Italy. 
go.nature.com/ammaum 


lipoproteins. The company 
stopped the trial after a review 
found that evacetrapib was 
ineffective; there were no safety 
concerns about the drug. The 
news triggered an 8% drop in 
Lilly’s share price on the day of 
the announcement. 


Toxic water 

AUS federal jury has found 
industrial chemicals giant 
DuPont in Wilmington, 
Delaware, liable for 
contaminating drinking 
water with C-8, an ingredient 
used in the manufacture of 
Teflon. Carla Bartlett, who 
developed kidney cancer 
after drinking the water, was 
awarded US$1.6 million in 
damages. Her lawsuit — the 
first of some 3,500 similar ones 
to go to trial — claimed that 
the water caused the cancer. 
The 7 October ruling found no 
malice on the part of DuPont, 
which used the chemical at 
its plant in Parkersburg, West 
Virginia. A DuPont spin-off 
company, Wilmington-based 
Chemours, will bear the 
liability. 


> NATURE.COM 
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The challenges include mapping the diversity of cells ina tumour, such as this breast-cancer cell. 


Giant charity lays out grand 
challenges of cancer 


Cancer Research UK invests £100 million in fashionable way to distribute cash. 


BY HEIDI LEDFORD 


ow can you distinguish between deadly 
H cancers that need treatment and benign 
cancers that do not? Can unknown 
carcinogens be identified by the molecular 


fingerprints that they leave in tumours? Could 
vaccines be used to prevent more cancers? 


The world’s largest cancer charity hopes to 
find out. On 12 October, Cancer Research UK 
(CRUK) in London announced its intention 
to dedicate at least £100 million (US$153 mil- 
lion) over 5 years to funding research teams 
to take on these and four other ‘grand chal- 
lenges. The targets were identified by a panel 
of independent international researchers 


convened by the charity, as well as input 
from the wider community. Teams eligible for 
funding can have members from anywhere 
in the world, but must include a UK-based 
contingent. 

The charity will continue to fund projects 
proposed by investigators. But its new grand- 
challenge approach is in keeping with the 
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| NEWS IN FOCUS 


GRAND CHALLENGES 


Things to know about cancer 


Cancer Research UK intends to spend 

£100 million (US$153 million) in pursuit 

of seven goals that could spur advances in 
preventing and treating the disease. They 
are: 

@ Develop vaccines to prevent non- 

viral cancers. This could stop a tumour 
early, before it becomes well established, 
metastasizes or begins to suppress the 
immune system. 

@ Eradicate EBV-induced cancers from 
the world. Epstein-Barr virus causes 
200,000 cancers per year. Vaccines or 
drugs that target this virus could save 
many lives. 

@ Discover how unusual patterns of 
mutation are induced by different cancer- 
causing events. Sometimes a carcinogen 
will produce a distinct pattern of DNA 
mutations inside cells. Such patterns could 
reveal previously unknown causes of cancer. 
@ Distinguish between lethal cancers that 
need treating, and non-lethal cancers that 
don’t. Catching cancer early is important 


> times, says Nic Jones, the charity’s chief sci- 
entist. “There’s a cultural change happening 
within biological sciences,” he says. “The idea 
is to be bold and take risks, and do things that 
might not work and might take a long time.” 

That idea is sweeping the globe: other 
prominent efforts have issued ‘grand chal- 
lenges’ that have tasked researchers with, 
among other things, devising ways to track the 
rise of antibiotic resistance and keeping tabs 
on all asteroids that pose a threat to human 
populations. 

The movement has its roots in a series of 
23 challenges issued by mathematician David 
Hilbert in 1900; these are credited with setting 
the course for twentieth-century mathematics. 
The Bill & Melinda Gates Foundation in Seat- 
tle, Washington, resurrected the idea in 2003 
with its vaunted Grand Challenges in Global 
Health programme, dedicating US$450 mil- 
lion to achieve 14 broad goals that included 
creating new vaccines and improving nutri- 
tion. Since then, the foundation has awarded 
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in defeating the disease, but over-diagnosis 
leads to unnecessary treatments — some 
of which can be life-altering, or have 
dangerous side effects. 

@ Find a way of mapping tumours 

at the molecular and cellular level. 
Tumours are made of a genetically 
diverse mixture of cells. Understanding 
what cells are present and what they are 
doing could enable physicians to predict 
how a tumour will respond to a given 
therapy. 

@ Develop innovative approaches to 
target the cancer super-controller MYC. 
The protein encoded by the MYC gene is 
often considered undruggable, but new 
approaches could change that. 

@ Deliver biologically active 
macromolecules to any and all cells 

in the body. Large molecules such as 
proteins and RNA can be remarkably 
specific drugs — but getting them into 
cells is a challenge that has dogged the 
field. HLL. 


1,752 grants in 81 countries under its grand- 
challenges banner. 

The model has been taken up around the 
world by governments, charities and uni- 
versities looking to tackle big societal prob- 
lems — and geta little publicity in the process. 
“The term has become sexy,” says Abdallah 
Daar, a public-health specialist at the Uni- 
versity of Toronto in Canada who helped to 
craft the Gates Foundation challenges. “It has 
this element of bigness and difficulty and gets 
attention. And when people win, there’s a big 
announcement.” 


WAYS FORWARD 
Daar notes that the CRUK programme 
stands out for its focus on basic research. 
Although each of the challenges could even- 
tually contribute to clinical innovations, 
most of them focus on laying the ground- 
work for therapeutic advances. 

One challenge, for example, aims to develop 
ways to map the cellular make-up ofa tumour 
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(see “Things to know about cancer’). DNA 
sequencing studies have shown that the popu- 
lation of cells in a tumour is genetically diverse 
— and the genes lurking in those cells could 
determine whether the cancer will become 
resistant to specific drugs. Understanding 
the cellular topography could help physicians 
select the right treatment. 

Another grand challenge asks researchers to 
find a way to deliver large molecules — such 
as certain proteins or RNAs — to any cell in 
the body. Although some large molecules, for 
example antibody therapies, are already used 
to treat cancer, they tend to operate outside 
the cell. “We know how to introduce these 
molecules in cell culture,’ says Tyler Jacks, 
director of the Koch Institute for Integrative 
Cancer Research at the Massachusetts Insti- 
tute of Technology in Cambridge. “If we could 
only do that in the body, we might be able to 
open up the door to all sorts of new types of 
therapy.” 


THE UNDRUGGABLE 

Other challenges are more specific: one, to 
eradicate cancers linked to the common 
Epstein-Barr virus, casts a spotlight on a 
cause of cancer that is particularly impor- 
tant in developing nations. And the charity’s 
team of advisers has challenged researchers to 
design a drug that will block a cancer-associ- 
ated protein called MYC. That is no simple 
task: researchers have been trying to do it for 
decades without success, and MYC is typically 
classified as an ‘undruggable’ target. But the 
hope, says Jones, is that methods uncovered 
in the attempt could also be applied to other 
elusive targets. 

CRUK intends to issue new challenges each 
year, and Jones hopes that suggestions for the 
next round will come from the community. 
Crafting a tractable yet inspiring challenge is 
an art, says Richard Klausner, chief medical 
officer of biotechnology company Illumina 
in San Diego, California, and chair of the 
CRUK challenge-selection committee. A few 
of Hilbert’s mathematical challenges remain 
unsolved more than a century after they were 
proposed; to avoid a similar outcome, Klausner 
says, the committee tried to avoid setting unat- 
tainable goals. 

“You want it to be ambitious enough, but not 
crazy, he says. “They need to be at the edge of 
doable — but still doable” m 
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The inner ear’s petrous bone is a rich source of DNA in archaeological specimens. 


PALAEOANTHROPOLOGY 


Hot climes yield 
ancient DNA 


After years of frustration, researchers are getting genetic 
material from old bones in warm places. 


BY EWEN CALLAWAY 


r he quest to glean DNA from ancient 
humans started in hot climes. In 1985, 
a paper in Nature (S. Paabo Nature 314, 
644-645; 1985) reported DNA sequences 
from an ancient Egyptian mummy — the first 
time anyone claimed to have isolated genetic 
material from a long-dead human. 

But geneticist Svante Paabo at the Max 
Planck Institute for Evolutionary Anthro- 
pology in Leipzig, Germany, later real- 
ized that the sequences were likely to be 
the result of contamination by modern 
DNA — possibly his own. Knowing that heat 
speeds up the decay of DNA, he and others 
turned their focus to remains from cooler 
climates, where DNA was more likely to persist. 

And that meant, inevitably, a geographic 
bias in ancient DNA research. “It’s really been 
a bit Eurocentric,” says Hannes Schroeder, a 
palaeogeneticist at the Natural History 
Museum of Denmark in Copenhagen. 

Now, after 30 years, researchers have 
returned to warmer places — this time find- 
ing more than contamination. Last week, a 


team published Africa's first ancient genome 
(M. Gallego Llorente et al. Science http://doi. 
org/78d; 2015), from a4,500-year-old skeleton 
in Ethiopia, and the genetic profiles of dozens 
of 8,000-year-old skeletons from the Middle 
East were reported at a meeting. These results 
reveal lost migra- 


tions and colour in “Africa is the 
the details of murky nextstepin 
prehistoric events far terms of the 
beyond Europe and population 
Asia. history of 
“Therearesomany humans. Itis an 
more fascinating obvious place to 
things happeningin gg and look.” 
human world history 


than just in Europe,” says Schroeder, who is 
using ancient DNA to trace the origins of Car- 
ibbean slaves. Breakthroughs in the past dec- 
ade have made it possible to study the very short 
and damaged DNA fragments typically found 
in remains from warmer places. Researchers 
have also found that a tiny part of the inner 
ear, the petrous portion of the temporal bone, 
holds much greater amounts of DNA than other 
human bones. “We finda crazy concentration of 
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ancient DNA in this region,” says Ron Pinhasi, 
an archaeologist at University College Dublin 
(R. Pinhasi et al. PLoS ONE 10, e0129102; 2015). 

Pinhasi and colleagues analysed DNA 
from the petrous of 26 individuals who 
lived in northwestern Anatolia (present-day 
Turkey) around 8,000 years ago, about the 
time that farming spread from the Middle 
East into Europe. The analysis suggests that 
they belonged to the source ofa migration into 
southeast and central Europe that displaced 
local hunter-gatherers, the team reported on 
9 October at a meeting of the American Soci- 
ety of Human Genetics in Baltimore, Maryland 
(I. Matheison et al. Preprint at bioRxiv at 
http://doi.org/4wt; 2015). 

Pinhasi also turned to the petrous bone 
after getting access to a 4,500-year-old human 
skeleton excavated in 2012 in the Mota cave in 
the highlands of Ethiopia. As reported in Sci- 
ence last week, he and his colleagues recovered 
enough DNA to sequence the ancient Ethiopi- 
an’s genome 12 times over, producing the first 
complete genome from an ancient African. 

The man’s genome is more closely related to 
present-day Ethiopian highlanders known as 
the Ari than to any other population the team 
examined. And using genetic evidence from 
Eurasian ancient genomes and present-day 
Eurasian and African populations, the research- 
ers determined that the Ari descend from Mid- 
dle Eastern farmers. They suggest that after 
expanding into Europe, some of these people 
later moved south to Africa, bringing new crops 
(see http://dx.doi.org/79n; 2015). 

“Africa is the next step in terms of the pop- 
ulation history of humans. It is an obvious 
place to go and look,” says Eske Willerslev, an 
evolutionary geneticist at the Natural History 
Museum of Denmark. He says that his team 
has also had success in obtaining DNA from 
African remains, including some that have 
not been protected by the relatively cool con- 
fines of a high-altitude cave. Willerslev expects 
that researchers will eventually strike lucky in 
extracting DNA from African remains that are 
many tens of thousands of years old, rather than 
a few thousand. 

Some researchers speculate that remains 
around 100,000 years old from the Skhul and 
Qafzeh caves in Israel might represent failed 
migrations out of Africa that preceded the 
global expansion of Homo sapiens by tens of 
millennia, whereas others disagree. Ancient 
DNA may be the only way to settle the debate. 

And few skeletons provoke such a debate 
among ancient-DNA researchers as Homo flo- 
resiensis, the mysterious ‘hobbit’ fossil discov- 
ered a decade ago on the Indonesian island of 
Flores. Past efforts to reap DNA from its bones 
have failed, yet its genetics may be the only way 
to determine its evolutionary relationship to H. 
sapiens. “I think there's a shot with floresiensis, 
Pinhasi says. “I think that has to be tested.” m 


Additional reporting by Sara Reardon. 
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Experiments to redefine 
kilogram converge at last 


After a fraught few years, results agree in time to meet a 2018 deadline. 


BY ELIZABETH GIBNEY 


retire ‘Le Grand K’ — the platinum and 

iridium cylinder that for 126 years has 
defined the kilogram from a high-security 
vault outside Paris. Now it looks as if they at 
last have the data needed to replace the cylin- 
der with a definition based on mathematical 
constants. 

The breakthrough comes in time for the 
kilogram to be included in a broader redefini- 
tion of units — including the ampere, mole and 
kelvin — scheduled for 2018. And this week, 
the International Committee for Weights and 
Measures (CIPM) will meet in Paris to thrash 
out the next steps. 

“Tt is an exciting time,’ says David Newell, a 
physicist at the US National Institute of Stand- 
ards and Technology (NIST) in Gaithersburg, 
Maryland. “It is the culmination of intense, 
prolonged efforts worldwide” 

The kilogram is the only SI unit still based 
on a physical object. Although experiments 
that could define it in terms of fundamental 
constants were described in the 1970s, only in 
the past year have teams using two completely 
different methods achieved results that are 
both precise enough, and in sufficient agree- 
ment, to topple the physical definition. 

Redefinition will not make the kilogram 
more precise, but it will make it more stable. 
A physical object can lose or gain atoms over 
time, or be destroyed, but constants remain 
the same. And a definition based on constants 
would, at least in theory, allow the exact kilo- 
gram measure to be available to someone 
anywhere on the planet, rather than just those 
who can access the safe in France, says Richard 
Davis, former head of the mass division of the 
International Bureau of Weights and Meas- 
ures (BIPM) in Sévres, France, which hosts 
the metal kilogram. 

In 2011, the CIPM formally agreed to 
express the kilogram in terms of Planck’s con- 
stant, which relates a particle’s energy to its 
frequency, and, through E= mc’, to its mass. 
This means first setting the Planck value using 
experiments based on the current reference 
kilogram, and then using that value to define 
the kilogram. The CIPM’s committee on mass 
recommends that three independent measure- 
ments of Planck’s constant agree, and that two 


| Ne decades, metrologists have strived to 


A replica of the kilogram mass reference, which is set to be replaced by a definition based on constants. 


of them use different methods. 

One method, pioneered by an international 
team known as the Avogadro Project, involves 
counting the atoms in two silicon-28 spheres 
that each weigh the same as the reference kilo- 
gram. This allows them to calculate a value 
for Avogadro’s constant, which the research- 
ers convert into a value for Planck’s constant. 

Another method uses 


“I think every a device called a watt 
metrologist balance to produce 
worried, ‘What a value for Planck’s 
if they never constant by weighing 
converge?’ sd a test mass calibrated 


according to the refer- 
ence kilogram against an eletromagnetic force. 

Reaching agreement proved difficult. In 
early 2011, some researchers contemplated 
simply averaging measurements from the 
two different devices (see Nature http://doi. 
org/cjzwdn; 2011). “I think every metrolo- 
gist worried, “What if they never converge?” 
says Davis. 

Such a fudge did not prove necessary, 
thanks to three years of intense work, says 
Joachim Ullrich, president of the German 
National Metrology Institute (PTB) in Braun- 
schweig, which coordinates the Avogadro 


Project, and chair of the CIPM’s Consultative 
Committee for Units. The first sign of pro- 
gress came after the Measurement Science 
and Standards laboratory in Ottawa, part of 
Canada’s National Research Council (NRC), 
bought and rebuilt a watt balance originally 
constructed at the UK National Physical Labo- 
ratory in Teddington. 

In a new lab, a fresh NRC team factored in 
some predicted but as yet unaccounted for 
systematic errors, and the result, published’ in 
January 2012, inched closer to the Avogadro 
Project's silicon-sphere result. 

That still left the result from NIST as an 
outlier, says Newell, chair of the international 
CODATA committee’s group on fundamen- 
tal constants, which provides a best value for 
constants such as Planck's every four years by 
taking into account the results of all the experi- 
ments done so far. “We brought in a whole new 
research team, we went over every component, 
went through every system,” he says. They 
never found the cause for the disagreement, 
but in late 2014 the NIST team achieved’ a 
match with the other two, who in the mean- 
time had shrunk their relative uncertainties to 
within the required levels. 

In August 2015, when CODATA DP 
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> published its latest value for Planck’s con- 
stant, the uncertainty was 12 parts per billion, 
just over one-quarter ofits value in CODATAs 
previous report — and within the CIPM’s 
requirements. 

The CIPM will discuss its next moves during 
its meeting at the BIPM on 15 and 16 October. 
This will include a discussion of the draft reso- 
lution that is expected to redefine the ampere, 
mole, kelvin and kilogram at the General Con- 
ference on Weights and Measures in 2018. The 
BIPM is still working on a protocol that will 


allow teams without access to a watt balance 
or silicon-sphere set-up to use a new kilogram 
definition. 

There is still scope for upset. The teams have 
until 1 July 2017 to publish further data before 
the value of Planck’s constant is fixed. Before 
this deadline, Ullrich’s team plans to use a new 
batch of spheres from Russia in experiments 
that he hopes will lead to even more-certain 
values for Planck’s constant, but could cause 
the results to diverge again. “Then we would be 
in trouble,’ he says. “But I’m very confident this 


will not happen.” Newell agrees: “This train has 
a lot of momentum and there has to be some- 
thing seriously wrong to derail it” 

If they are proved right, in 2018, Le Grand K 
will join the metre as a museum piece. “We'll 
keep it,” says Davis, “but it won't be defining 
anything anymore.’ = 
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CLIMATE CHANGE 


Firms that suck carbon 
from air go commercial 


Two companies announce that they are expanding and upgrading their plants. 


BY DANIEL CRESSEY 


thas long been regarded as one of the more 
blue-skies solutions to climate change. Now 
two companies have vastly increased their 
capability to suck carbon dioxide from the air. 
One, based in Canada, plans to convert cap- 
tured CO, into diesel to fuel buses; the other, 
in Switzerland, will sell it on to a firm that uses 
CO, to boost crop growth in greenhouses. 
The carbon emissions that this will save 
are not significant. But David Keith, execu- 
tive chairman of the Canadian firm, Carbon 
Engineering in Calgary, and a climate physicist 


Carbon Engineering’s demonstration plant in British Columbia captures carbon dioxide from the air. 


at Harvard University in Cambridge, Massa- 
chusetts, says that his company’s air-capture 
plant will position the technology to be further 
scaled up. Most significantly, he says, the plant 
will now run the whole process — from CO, 
capture to regeneration — for the first time. 
Others are excited by the development. 
“The fact they're getting to commercial-scale 
prototypes is incredibly encouraging,” says 
Noah Deich, executive director of the Center 
for Carbon Removal in Berkeley, California. 
More than a dozen facilities worldwide, 
including oil refineries and power plants, 
already capture millions of tonnes of CO, from 
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the flue gases they expel. The idea of capturing 
carbon directly from the atmosphere — where 
CO, is present in much lower concentrations 
than in flue gases and so is harder to extract 
— has been around for several years, but only 
in the form of small, demonstration projects. 

On 9 October, Carbon Engineering offi- 
cially opened a new plant in Squamish, Brit- 
ish Columbia, that can capture and process 
around 1 tonne of CO, per day — about the 
same as a typical car might emit when driven 
about 5,000 kilometres. This represents a big 
step up from the company’s earlier demon- 
stration plant, which ran only the first step of 
capture and did not regenerate gaseous CO). 

The plant uses fans to push air through 
towers containing potassium hydroxide solu- 
tion, which reacts with CO, to form potassium 
carbonate; the remaining air, now containing 
less CO.,, is released. Further treatment of the 
solution separates out the captured CO,, regen- 
erating the capture solution for reuse. These 
processes are currently powered by electricity, 
which in British Columbia is mainly generated 
by hydroelectric sources, says Keith. Initially, 
the company will re-release the captured CO,, 
but Carbon Engineering announced last week 
that it had signed a Can$435,000 (US$333,000) 
deal with the province of British Columbia to 
assess the potential of turning the CO, into fuel 
to power local buses. 

Meanwhile, the Swiss company, Climeworks 
in Zurich, announced at a UK meeting on 
greenhouse-gas capture in Oxford earlier this 
month that it plans to start capturing CO, on 
a commercial scale. Its plant in Hinwil, Swit- 
zerland, will capture 1,000 tonnes of CO, per 
year starting in mid-2016, according to Anca 


CARBON ENGINEERING 


ELIZA GRINNELL/SEAS, HARVARD 


Timofte, a process engineer at the company. 

In some ways the technology is similar to 
that of Carbon Engineering, but Climeworks 
will instead use granules to soak up the CO,, 
using a module that will sit on top of an incin- 
eration plant. (The technology is still classed as 
air capture because the material will scrub CO, 
from air near the plant rather than from the 
expelled gases.) Waste heat from the incinera- 
tor will be used to drive the captured CO, off 
the granules, which can then be reused. 

The company has arranged to sell CO, 
produced in this way to the firm Gebriider 
Meier, which will use it to increase crop yields 
in greenhouses. Climeworks is also assessing 
the beverage industry as a source of potential 
customers, says Timofte. 

If such companies are to scale up further 
and make money, one challenge will be find- 
ing buyers for their CO, says Tim Kruger, a 
geoengineer at the University of Oxford, UK, 
who organized the Oxford meeting and runs a 
company, Origen Power, that hopes to gener- 
ate carbon-negative energy. And it is not clear 
whether companies will be able to produce CO, 
or related products at a price that is competitive 
enough to attract a wide pool of clients. 

In 2011, a report from the American Physi- 
cal Society (APS) estimated that air capture 


David Keith, chairman of Carbon Engineering. 


would cost at least US$600 per tonne of CO,, 
assuming a large system that removed 1 million 
tonnes of CO, per year. But Climeworks says 
that its price will be in that range in the first 
year of its plant’s operation, despite being on 
a smaller scale than the APS example. The 
company also expects that cost to fall as the 
technology develops. Keith, meanwhile, says 
that the CO, produced by Carbon Engineer- 
ing’s plant is expensive, but emphasizes that 
it is a pilot; he says that prices of $100-200 
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per tonne of CO, are realistic for the bigger 
iterations that it is planning. 

Even if the companies cannot compete on 
price with conventionally manufactured CO, 
(which can be as low as tens of dollars per 
tonne but can be significantly higher), there are 
other factors that could help to create demand 
for air-captured CO,. The introduction of a 
carbon tax could incentivize big emitters to 
pay other companies to mop up their CO, to 
avoid paying the tax. And if the world is ever 
to become completely carbon neutral, air cap- 
ture will have a part to play, says Nilay Shah, an 
engineer working on low-carbon technologies 
at Imperial College London. 

Efforts to mitigate climate change should 
focus on capturing CO, at the source. But there 
are many scenarios in which pre-emission cap- 
ture is not viable. “Once you start to get into 
things like capturing carbon from vehicles 
or from household boilers, that’s much more 
expensive,” says Shah. “You may well be better 
off capturing CO, from the air” 

Keith emphasizes that his company is not 
trying to fix climate change on its own. “Air 
capture has been stuck in a catfight between 
one group of people saying it’s a silver bullet 
and one group saying it’s bullshit,” he says. 
“The truth is it’s neither.” = 


DNA-repair sleuths win 
chemistry Nobel 


Tomas Lindahl, Paul Modrich and Aziz Sancar share prize for work on how DNA heals itself. 


BY DANIEL CRESSEY 


r | Whe 2015 Nobel Prize in Chemistry was 
awarded last week to three researchers 
for their work on DNA repair. 

Tomas Lindahl, Paul Modrich and Aziz 
Sancar “mapped, at a molecular level, how 
cells repair damaged DNA and safeguard the 
genetic information’, said the Royal Swedish 
Academy of Sciences in Stockholm, which 
awards the prize. 

DNA is not a stable molecule, but slowly 
decays over time. For life to exist — as Lindahl 
first realized while working at the Karolinska 
Institute in Stockholm in the 1970s — there 
must be repair mechanisms that fight back 
against this process. 

Numerous scientists have since chroni- 
cled the many ways in which damaged DNA 
is patched up, says Stephen West, who works 
on DNA repair at the Francis Crick Institute 
in London, where Lindahl is now an emeritus 


group leader. “The DNA-repair field is a large 
field? says West. “Many of us thought a Nobel 
would not go to this field because there are so 
many people with a claim to the prize.” 

But the three repair mechanisms recognized 
with the Nobel prize “are probably the three 
most important and best-understood 
mechanisms’, he says, adding that the awards 
are “fantastically well deserved”. 


REPAIR JOBS 

Lindahl, who is regarded as one of the found- 
ers of the field, chronicled a process dubbed 
base excision repair, in which specific enzymes 
recognize, cut out and patch up bases in the 
DNA molecule. Before his work, “I don’t think 
anybody really considered the idea that DNA 
requires active engagement by a set of house- 
keeping processes to keep it in a stable state,” 
says Keith Caldecott, who studies DNA repair 
at the University of Sussex in Brighton, UK, 
and did postdoctoral work with Lindahl. 


Sancar — who was born in Savur, Turkey, 
but has spent most of his professional life in the 
United States and is now at the University of 
North Carolina at Chapel Hill — worked in the 
1980s to explain how cells use enzymes to repair 
damage to DNA from ultraviolet rays or other 

carcinogens, through 


“We need DNA a system called nucle- 
repair but we otide excision repair. 

don’t like it And in 1989, 
thatthecancer = Modrich, who is at 
cells have DNA Duke University 
repair.” School of Medicine 


in Durham, North 
Carolina, published work on a third mecha- 
nism — ‘mismatch repair’ — which deals with 
errors produced when DNA is copied. 

This September, the prestigious Albert 
Lasker Basic Medical Research Award was also 
awarded for work on how cells correct damage 
to DNA. But it went to two other researchers: 
Evelyn Witkin of Rutgers University in New > 
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Tomas Lindahl, Paul Modrich and Aziz Sancar share the 2015 Nobel Prize in Chemistry. 


Brunswick, New Jersey, and Stephen Elledge 
of Brigham and Women’s Hospital in Boston, 
Massachusetts. 


WIDER IMPACTS 

Speaking to reporters in Stockholm at the 
Nobel press conference, Lindahl noted that 
understanding DNA repair has implications 
for human health. People with faults in their 
repair system have an increased risk of devel- 
oping cancers, because damaging mutations 


can go uncorrected. Cancer cells themselves 
survive damage by using enzymes to patch up 
DNA, and there is now interest in therapies that 
target DNA-repair pathways in tumour cells. 
“We need DNA repair but we don't like it that 
the cancer cells have DNA repair,’ Lindahl said. 

Work in the field has had an impact in other 
areas, too. Lindahl’s research proved influential 
in the 1980s and 1990s, when scientists were 
first working to extract and analyse ancient 
DNA. The patterns of DNA damage that he 


first characterized are now used as a stamp of 
authenticity to show that DNA is ancient, and 
not modern contamination. 

The chemistry prize follows the award of 
the Nobel Prize in Physiology or Medicine to 
William Campbell, Satoshi Omura and Youyou 
Tu for their work on therapies against parasitic 
infections. The physics Nobel went to Takaaki 
Kajita and Arthur McDonald for showing that 
neutrinos have mass. m 


CORRECTIONS 

The News story ‘Neutrino flip wins physics 
prize’ (Nature 526, 175; 2015) wrongly 
implied that physicists knew about all three 
types of neutrino in the 1960s. In fact, the tau 
neutrino was postulated only in the 1970s. 
The News Feature ‘The impenetrable proof’ 
(Nature 526, 178-181; 2015) wrongly 
located the University of Antwerp in the 
Netherlands. It is, of course, in Belgium. 

In the News Feature ‘The mitochondria 
mystery’ (Nature 525, 444-446; 2015), the 
quote “The standards for a shampoo seem 

to be harsher” was erroneously attributed 

to Ted Morrow. When he said these words, 

he was characterizing the stance of another 
researcher, not expressing his own opinion, 
and the quote should not have been included. 
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THE LANDSCAPE 
FOR HUMAN. 
GENOME EDITIN 


BY HEIDI LEDFORD 


A view of international regulations 


suggests whereintheworlda - 


CRISPR baby could be born. 


Kingdom; and they met in the United States last week. Around 

the world, scientists are gathering to discuss the promise and perils 
of editing the genome ofa human embryo. Should it be allowed — and 
if so, under what circumstances? 

The meetings have been prompted by an explosion of interest in 
the powerful technology known as CRISPR/Cas9, which has brought 
unprecedented ease and precision to genetic engineering. This tool, 
and others like it, could be used to manipulate the DNA of embryos in a 
dish to learn about the earliest stages of human development. In theory, 
genome editing could also be used to ‘fix’ the mutations responsible for 
heritable human diseases. If done in embryos, this could prevent such 
diseases from being passed on. 

The prospects have prompted widespread concern and discussion 
among scientists, ethicists and patients. Fears loom that if genome 
editing becomes acceptable in the clinic to stave off disease, it will 
inevitably come to be used to introduce, enhance or eliminate traits for 
non-medical reasons. Ethicists are concerned that unequal access to 
such technologies could lead to genetic classism. And targeted changes 
to a person’s genome would be passed on for generations, through the 
germ line (sperm and eggs), fuelling fears that embryo editing could 
have lasting, unintended consequences. 

Adding to these concerns, the regulations in many countries have not 
kept pace with the science. 

Nature has tried to capture a snapshot of the 
legal landscape by querying experts and govern- 
ment agencies in 12 countries with histories of 
well-funded biological research. The responses 
reveal a wide range of approaches. In some 
countries, experimenting with human embryos 
at all would be a criminal offence, whereas in 
others, almost anything would be permissible. 

Concerns over the manipulation of human 
embryos are nothing new. Rosario Isasi, a legal scholar at McGill Univer- 
sity in Montreal, Canada, points to two key waves of legislation over the 
years: one sparked by concerns about the derivation of embryonic stem 
cells, which was largely deemed acceptable; the other about reproductive 
cloning, which was largely prohibited for safety reasons. 

The current regulatory mosaic is their legacy. Tetsuya Ishii, a 


Te are meeting in China; they are meeting in the United 
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“WE HAVE GUIDELINES 
BUT SOME PEOPLE NEVER 
FOLLOW THEM.” 


THE UNITED STATES does 
not allow the use of federal 
funds to modify human 
embryos, but there are no 
outright genome-editing 
bans. Clinical development 
may require approval. 


ARGENTINA bans reproductive 
cloning, but research applications 
of human-genome editing are not 
clearly regulated. 


bioethicist at Hokkaido University in Sapporo, Japan, spent nearly a 
year analysing relevant legislation and guidelines in 39 countries, and 
found that 29 have rules that could be interpreted as restricting genome 
editing for clinical use (M. Araki and T. Ishii Reprod. Biol. Endocrinol. 
12, 108; 2014). But the ‘bans’ in several of these countries — includ- 
ing Japan, China and India — are not legally 
binding. “The truth is, we have guidelines 
but some people never follow them,” said 
Qi Zhou, a developmental biologist at the 
Chinese Academy of Sciences Institute of 
Zoology in Beijing, at a meeting hosted by the 
US National Academy of Sciences in Wash- 
ington DC last week. Ishii considers the rules 
in nine other countries — among them Rus- 
sia and Argentina — to be “ambiguous”. The United States, he notes, 
prohibits federal funding for research involving human embryos, and 
would probably require regulatory approval for human gene editing, but 
does not officially ban the use of the technique in the clinic. In countries 
where clinical use is banned, such as France and Australia, research is 
usually allowed as long as it meets certain restrictions and does not 
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THE UNITED KINGDOM’s 
independent Human 
Fertilisation and 
Embryology Authority 
may permit human- 
genome editing for 
research, but the practice 
is banned in the clinic. 


GERMANY has strict laws on the use 
of embryos in assisted reproduction. 
It also limits research on human 
embryos, and violations could result 
in criminal charges. 


CRISPR EMBRYOS AND THE LAW 


Regulations governing genetic modification 
in human embryos vary. Some countries 
ban the practice through legislation that 
carries criminal penalties; others have 
unenforceable guidelines. 


attempt to generate a live birth (see ‘CRISPR embryos and the law). 

Many researchers long for international guidelines that, even if 
not enforceable, could guide national lawmakers. Developing such a 
framework is one of the aims of ongoing discussions; the US National 
Academy, for example, plans to hold an international summit in Decem- 
ber and then produce recommendations for 
responsible use of the technique in 2016. 

But the research has already begun, and 
more is coming. Scientists in China announced 
in April that they had used CRISPR to alter the 
genomes of human embryos, albeit ones inca- 
pable of producing a live baby (P. Liang et al. 
Protein Cell 6, 363-372; 2015). Xiao-Jiang Li,a 
neuroscientist at Emory University in Atlanta, 
Georgia, who has used the technique in monkeys, says he has heard 
rumours that several other Chinese laboratories are already doing such 
experiments. And in September, developmental biologist Kathy Niakan 
of the Francis Crick Institute in London applied to the UK Human Ferti- 
lisation and Embryology Authority for permission to use the technique 
to study errors in embryo development that can contribute to infertility 
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JAPAN, like China, India and Ireland, 
has unenforceable guidelines that 
restrict the editing of a human 
embryo’s genome. 


@ Ban (legislation) 
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“TE WE D0 THE WRONG 
THING, WE CAN SEND THE 
WRONG MESSAGE.” 


and miscarriage. No one so far has declared an interest in producing 
live babies with edited genomes, and initial experiments would suggest 
that it is not yet safe. But some suspect that it is only a matter of time. 

Ishii predicts that countries with high rates of in vitro fertiliza- 
tion will be the first to attempt clinical applications. Japan, he says, 
has one of the highest numbers of fertility 
clinics in the world, and has no enforceable 
rules on germline modification. The same is 
true for India. 

Guoping Feng, a neuroscientist at the 
Massachusetts Institute of Technology in 
Cambridge, hopes that with improvement, the 
technique could eventually be used to prevent 
genetic disease. But he argues that it is much 
too soon to be trying it in the clinic. “Nowis not the time to do human- 
embryo manipulation,’ he says. “If we do the wrong thing, we can send 
the wrong message to the public — and then the public will not support 
scientific research anymore.” m 


Heidi Ledford writes for Nature from Cambridge, Massachusetts. 
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Neuroscientists 

are probing the 

connections 

between intestinal 
q microbes and brain 


BY PETER ANDREY SMITH 


development. 


early a year has passed since 
Rebecca Knickmeyer first met 
the participants in her latest 
study on brain development. 
Knickmeyer, a neuroscientist at 
the University of North Carolina 
School of Medicine in Chapel 
Hill, expects to see how 30 newborns have 
grown into crawling, inquisitive one-year-olds, 
using a battery of behavioural and temperament 
tests. In one test, a child’s mother might disap- 
pear from the testing suite and then reappear 
with a stranger. Another ratchets up the weird- 
ness with some Halloween masks. Then, if all 
goes well, the kids should nap peacefully as a 
noisy magnetic resonance imaging machine 
scans their brains. 

“We try to be prepared for everything,” 
Knickmeyer says. “We know exactly what to do 
if kids make a break for the door” 

Knickmeyer is excited to see something else 
from the children — their faecal microbiota, the 
array of bacteria, viruses and other microbes 
that inhabit their guts. Her project (affection- 
ately known as ‘the poop study’) is part of a 
small but growing effort by neuroscientists to 

see whether the microbes that colonize the 

gut in infancy can alter brain development. 
The project comes at a crucial 
juncture. A growing body of data, 
mostly from animals raised in ster- 
ile, germ-free conditions, shows 
that microbes in the gut influence 
behaviour and can alter brain physi- 

ology and neurochemistry. 
In humans, the data are more limited. 
Researchers have drawn links between 
gastrointestinal pathology and psychi- 
F atric neurological conditions such as 
anxiety, depression, autism, schizophrenia 
and neurodegenerative disorders — but they 

are just links. 

“In general, the problem of causality in 
microbiome studies is substantial? says Rob 
Knight, a microbiologist at the University of 
California, San Diego. “It’s very difficult to tell 
if microbial differences you see associated with 
diseases are causes or consequences.” There are 
many outstanding questions. Clues about the 
mechanisms by which gut bacteria might inter- 
act with the brain are starting to emerge, but no 
one knows how important these processes are 
in human development and health. 

That has not prevented some companies in 
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the supplements industry from claiming that 
probiotics — bacteria that purportedly aid 
with digestive issues — can support emotional 
well-being. Pharmaceutical firms, hungry for 
new leads in treating neurological disorders, 
are beginning to invest in research related to gut 
microbes and the molecules that they produce. 

Scientists and funders are looking for clarity. 
Over the past two years, the US National Insti- 
tute of Mental Health (NIMH) in Bethesda, 
Maryland, has funded seven pilot studies 
with up to US$1 million each to examine 
what it calls the ‘microbiome-gut—brain axis’ 
(Knickmeyer’s research is one of these studies). 
This year, the US Office of Naval Research in 
Arlington, Virginia, agreed to pump as much 
as US$52 million over the next few years into 
work examining the gut’s role in cognitive func- 
tion and stress responses. And the European 
Union has put €9 million (US$10.1 million) 
towards a five-year project called MyNew- 
Gut, two main objectives of which target brain 
development and disorders. 

The latest efforts aim to move beyond basic 
observations and correlations — but prelimi- 
nary results hint at complex answers. Research- 
ers are starting to uncover a vast, varied system 
in which gut microbes influence the brain 
through hormones, immune molecules and 
the specialized metabolites that they produce. 

“There’s probably more speculation than 
hard data now,” Knickmeyer says. “So there's a 
lot of open questions about the gold standard 
for methods you should be applying. It’s very 
exploratory.’ 


Microbes and the brain have rarely been 
thought to interact except in instances when 
pathogens penetrate the blood-brain barrier 
— the cellular fortress protecting the brain 
against infection and inflammation. When they 
do, they can have strong effects: the virus that 
causes rabies elicits aggression, agitation and 
even a fear of water. But for decades, the vast 
majority of the body’s natural array of microbes 
was largely uncharacterized, and the idea that it 
could influence neurobiology was hardly con- 
sidered mainstream. That is slowly changing. 

Studies on community outbreaks were one 
key to illuminating the possible connections. 
In 2000, a flood in the Canadian town of Walk- 
erton contaminated the town’s drinking water 
with pathogens such as Escherichia coli and 
Campylobacter jejuni. About 2,300 people suf- 
fered from severe gastrointestinal infection, and 
many of them developed chronic irritable bowel 
syndrome (IBS) as a direct result. 

During an eight-year study’ of Walkerton 
residents, led by gastroenterologist Stephen 
Collins at McMaster University in Hamilton, 
Canada, researchers noticed that psycho- 
logical issues such as depression and anxiety 
seemed to be a risk factor for persistent IBS. 
Premysl Bercik, another McMaster gastroen- 
terologist, says that this interplay triggered 


intriguing questions. Could psychiatric symp- 
toms be driven by lingering inflammation, or 
perhaps by a microbiome thrown out of whack 
by infection? 

The McMaster group began to look for 
answers in mice. In a 2011 study’, the team 
transplanted gut microbiota between different 
strains of mice and showed that behavioural 
traits specific to one strain transmitted along 
with the microbiota. Bercik says, for example, 
that “relatively shy” mice would exhibit more 
exploratory behaviour when carrying the 
microbiota of more-adventurous mice. “I think 
it is surprising. The microbiota is really driv- 
ing the behavioural phenotype of host. There's 
a marked difference,’ Bercik says. Unpublished 
research suggests that taking faecal bacteria 
from humans with both IBS and anxiety and 
transplanting it into mice induces anxiety-like 


“THERE $ PROBABLY 
MORE SPECULATION 
THAN HARD DATA NOW.” 


behaviour, whereas transplanting bacteria from 
healthy control humans does not. 

Such results can be met with scepticism. As 
the field has developed, Knight says, microbi- 
ologists have had to learn from behavioural sci- 
entists that how animals are handled and caged 
can affect things such as social hierarchy, stress 
and even the microbiome. 

And these experiments and others like them 
start with a fairly unnatural model: germ- 
free — or ‘gnotobiotic — mice. These animals 
are delivered by Caesarean section to prevent 
them from picking up microbes that reside in 
their mothers’ birth canals. They are then raised 
inside sterile isolators, on autoclaved food and 
filtered air. The animals are thus detached from 
many of the communal microbes that their spe- 
cies has evolved with for aeons. 

In 2011, immunologist Sven Pettersson and 
neuroscientist Rochellys Diaz Heijtz, both at the 
Karolinska Institute in Stockholm, showed that 
in lab tests, germ-free mice demonstrated less- 
anxious behaviour than mice colonized with 
natural indigenous microbes’. (Less anxiety is 
not always a good thing, evolutionarily speak- 
ing, for a small mammal with many predators.) 

When the Karolinska team examined the 
animals’ brains, they found that one region 
in germ-free mice, the striatum, had higher 
turnover of key neurochemicals that are asso- 
ciated with anxious behaviour, including the 
neurotransmitter serotonin. The study also 
showed that introducing adult germ-free mice 
to conventional, non-sterile environments 
failed to normalize their behaviour, but the off- 
spring of such ‘conventionalized’ mice showed 
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some return to normal behaviour, suggesting 
that there is a critical window during which 
microbes have their strongest effects. 

By this time, many researchers were intrigued 
by the mounting evidence, but results stemmed 
mostly from fields other than neuroscience. 
“The groups working on this are primarily gut 
folks, with a few psychology-focused people 
collaborating,” says Melanie Gareau, a physiolo- 
gist at the University of California, Davis. “So 
the findings tended to describe peripheral and 
behavioural changes rather than changes to the 
central nervous system.” 

But Pettersson and Diaz Heijtz’s research 
galvanized the field, suggesting that researchers 
could get past observational phenomenology 
and into the mechanisms affecting the brain. 
Nancy Desmond, a programme officer involved 
in grant review at the NIMH, says that the paper 
sparked interest at the funding agency soon 
after its publication and, in 2013, the NIMH 
formed a study section devoted to neurosci- 
ence research that aims to unravel functional 
mechanisms and develop drugs or non-invasive 
treatments for psychological disorders. 

Judith Eisen, a neuroscientist at the 
University of Oregon in Eugene, earned a grant 
to study germ-free zebrafish, whose transpar- 
ent embryos allow researchers to easily visual- 
ize developing brains. “Of course, ‘germ-free is 
a completely unnatural situation,’ Eisen says. 
“But it provides the opportunity to learn which 
microbial functions are important for develop- 
ment of any specific organ or cell type.’ 


CHEMICAL EXPLORATION 


Meanwhile, researchers were starting to uncover 
ways that bacteria in the gut might be able to 
get signals through to the brain. Pettersson and 
others revealed that in adult mice, microbial 
metabolites influence the basic physiology of 
the blood-brain barrier*. Gut microbes break 
down complex carbohydrates into short-chain 
fatty acids with an array of effects: the fatty acid 
butyrate, for example, fortifies the blood-brain 
barrier by tightening connections between cells 
(see ‘The gut-brain axis’). 

Recent studies also demonstrate that gut 
microbes directly alter neurotransmitter levels, 
which may enable them to communicate with 
neurons. For example, Elaine Hsiao, a biologist 
nowat the University of California, Los Angeles, 
published research’ this year examining how 
certain metabolites from gut microbes pro- 
mote serotonin production in the cells lining the 
colon — an intriguing finding given that some 
antidepressant drugs work by promoting sero- 
tonin at the junctions between neurons. These 
cells account for 60% of peripheral serotonin in 
mice and more than 90% in humans. 

Like the Karolinska group, Hsiao found that 
germ-free mice have significantly less seroto- 
nin floating around in their blood, and she also 
showed that levels could be restored by intro- 
ducing to their guts spore-forming bacteria 
(dominated by Clostridium, which break down 
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short-chain fatty acids). Conversely, mice with 
natural microbiota, when given antibiotics, had 
reduced serotonin production. “At least with 
those manipulations, it’s quite clear there's a 
cause-effect relationship,” Hsiao says. 

But it remains unclear whether these altered 
serotonin levels in the gut trigger a cascade of 
molecular events, which in turn affect brain 
activity — and whether similar events take place 
in humans, too. “It will be important to replicate 
previous findings, and translate these findings 
into human conditions to really make it to the 
textbooks,” Hsiao says. 

For John Cryan, a neuroscientist at University 
College Cork in Ireland, there is little question 
that they will. His lab has demonstrated’ that 
germ-free mice grow more neurons in a spe- 
cific brain region as adults than do conventional 
mice. He has been promoting the gut-brain axis 
to neuroscientists, psychiatric-drug researchers 
and the public. “If you look at the hard neurosci- 
ence that has emerged in the last year alone, all 
the fundamental processes that neuroscientists 
spend their lives working on are now all shown 
to be regulated by microbes,’ he says, pointing 
to research on the regulation of the blood-brain 
barrier, neurogenesis in mice and the activation 
of microglia, the immune-like cells that reside 
in the brain and spinal cord. 

At the 2015 Society for Neuroscience meeting 
in Chicago, Illinois, this month, Cryan and his 
colleagues plan to present research showing that 
myelination — the formation of fatty sheathing 
that insulates nerve fibres — can also be influ- 
enced by gut microbes, at least in a specific part 
of the brain. Unrelated work’ has shown that 
germ-free mice are protected from an experi- 
mentally induced condition similar to multiple 
sclerosis, which is characterized by demyeli- 
nation of nerve fibres. At least one company, 
Symbiotix Biotherapies in Boston, Massachu- 
setts, is already investigating whether a metabo- 
lite produced by certain types of gut bacterium 
might one day be used to stem the damage in 
humans with multiple sclerosis. 


A MOVE TO THERAPY 

Tracy Bale, a neuroscientist at the University of 
Pennsylvania in Philadelphia, suspects that sim- 
ple human interventions may already be war- 
ranted. Bale heard about Cryan’s work on the 
radio programme Radiolab three years ago. At 
the time, she was researching the placenta, but 
wondered how microbes might fit into a model 
of how maternal stress affects offspring. 

In research published this year’, Bale 
subjected pregnant mice to stressful stimuli. 
She found that it noticeably reduced the levels 
of Lactobacilli present in the mice’s vaginas, 
which are the main source of the microbes that 
colonize the guts of offspring. These microbial 
shifts carried over to pups born vaginally, and 
Bale detected signs that microbiota might affect 
neurodevelopment, especially in males. 

In work that her group plans to present at 
the Society for Neuroscience meeting, Bale has 


THE 
GUT—BRAIN 
AXIS 


The mechanisms by 
which gut microbes 
and the brain might 


communicate are 
unclear, but there are 
several tantalizing 
leads for researchers 
to follow. 


Gut microbes, 
including bacteri 
and viruses 


shown that by feeding vaginal microbiota from 
stressed mice to Caesarean-born infant mice, 
they can recapitulate the neurodevelopmental 
effects of having a stressed mother. Bale and 
her colleagues are now wrapping up research 
investigating whether they can treat mice from 
stressed mums with the vaginal microbiota of 
non-stressed mice. 

The work, Bale says, has “immediate trans- 
lational effects” She points to a project headed 
by Maria Dominguez-Bello, a microbiologist at 
the New York University School of Medicine, in 
which babies born by means of Caesarean sec- 
tion are swabbed on the mouth and skin with 
gauze taken from their mothers’ vaginas. Her 
team wants to see whether these offspring end 
up with microbiota similar to babies born vagi- 
nally. “It's not standard of care,’ Bale says, “but I 
will bet you, one day, it will be” 

Many are still sceptical about the link 
between microbes and behaviour and whether 
it will prove important in human health — but 
scientists seem more inclined to entertain the 
idea now than they have in past. In 2007, for 
example, Francis Collins, now director of the 
US National Institutes of Health, suggested that 
the Human Microbiome Project, a large-scale 
study of the microbes that colonize humans, 
might help to unravel mental-health disorders. 
“It did surprise a few people who assumed we 
were talking about things that are more intes- 
tinal than cerebral,” Collins says. “It was a little 
bit of leap, but it’s been tentatively backed up” 

Funding agencies are supporting the 
emerging field, which spans immunology, 
microbiology and neuroscience, among other 
disciplines. The NIMH has offered seed funding 
for work on model systems and in humans to 
probe whether the area is worth more-substan- 
tial investment, a move that has already brought 
more researchers into the fold. The MyNewGut 
project in Europe has an even more optimistic 
view of the value of such research, specifically 
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1. PERIPHERAL 
SEROTONIN: 

Cells in the gut 
produce large 
quantities of the 
neurotransmitter 
serotonin, which may 
have an effect on 
signalling in the brain. 


2. IMMUNE SYSTEM: 
The intestinal 
microbiome can 
prompt immune cells 
to produce cytokines 
that can influence 
neurophysiology. 


3. BACTERIAL 
MOLECULES: 
Microbes produce 
metabolites such as 
butyrate, which can 
alter the activity of 
cells in the 

** _ blood-brain barrier. 


seeking concrete dietary recommendations that 
might alleviate brain-related disorders. 

Today, Knickmeyer’s project on infants 
represents what she calls “a messy take-all-com- 
ers kind of sample”. Among the brain regions 
that Knickmeyer is scanning, the amygdala 
and prefrontal cortex hold her highest interest; 
both have been affected by microbiota manip- 
ulations in rodent models. But putting these 
data together with the dozens of other infant 
measures that she is taking will be a challenge. 
“The big question is how you deal with all the 
confounding factors.” The children’s diets, home 
lives and other environmental exposures can all 
affect their microbiota and their neurological 
development, and must be teased apart. 

Knickmeyer speculates that tinkering with 
microbes in the human gut to treat mental- 
health disorders could fail for other reasons. 
‘Take, for instance, how microbes might inter- 
act with the human genome. Even if scientists 
were to find the therapeutic version of a “gold 
Cadillac of microbiota’, she points out, “maybe 
your body rejects that and goes back to baseline 
because your own genes promote certain types 
of bacteria.” There is much more to unravel, she 
says. “I’m always surprised. It's very open. It'sa 
little like a Wild West out there” = 


Peter Andrey Smith is a reporter based in 
New York City. 
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COMMENT 


"POLICY Eight 
ways to get the best 


Al Three books agree that 
human psychology isa barrier 
to robot superiority p.320 


MUSIC In conversation with 
the creator of a chamber 
opera exploring time p.322 


OBITUARY William Paul, 
discoverer of interleukin-4, 
remembered p.324 


Price carbon — | will 
if you will 


To forge a strong climate accord in Paris, nations must agree on acommon goal 
in everyone’s self-interest, say David J. C. MacKay and colleagues. 


egotiations at the United Nations 
Nim summit in Paris this Decem- 

ber will adopt a ‘pledge and review’ 
approach to cutting global carbon emissions. 
Countries will promise to reduce their emis- 
sions by amounts that will be revised later. 
The narrative is that this will “enable an 
upward spiral of ambition over time”’. His- 
tory and the science of cooperation predict 
that quite the opposite will happen. 


Climate change is a serious challenge 
because the atmosphere gives a free ride to 
countries that emit. If some nations sit back 
and rely on others efforts, the incentives for 
anyone to act are weakened. Review of the 
first phase of the Kyoto Protocol at the 2012 
UN climate meeting in Doha, for instance, 
resulted in Japan, Russia, Canada and New 
Zealand leaving the agreement, frustrating 
those who kept their promises. 


Success requires a common commit- 
ment, not a patchwork of individual ones. 
Negotiations need to be designed to realign 
self-interests and promote cooperation. 
A common commitment can assure par- 
ticipants that others will match their efforts 
and not free-ride. A strategy of “I willifyou 
will” stabilizes higher levels of cooperation. 
It is the most robust pattern of cooperation 
seen in laboratory and field studies of > 
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> situations open to free-riding’. 

A global carbon price — so far excluded 
from consideration in international negotia- 
tions — would be the ideal basis for a com- 
mon commitment in our view. A price is 
easy to agree and handle, relatively fair, less 
vulnerable to gaming than global cap-and- 
trade systems, and consistent with climate 
policies already in place, such as fossil-fuel 
taxes and emissions cap-and-trade. 

Only a common commitment can lead to 
a strong treaty. Forty years of empirical and 
theoretical literature on cooperation confirms 
that individual commitments do not deliver 
strong collective action. Cooperators find that 
defectors take advantage of them. Ambition 
declines when others are revealed to be free- 
riding’. Dishes often stack up in the sinks of 
shared apartments. But in the Alps, villagers 
have successfully managed shared land for 
hundreds of years, with a common commit- 
ment governing grasslands*. 


COMMON COMMITMENT 

Imagine that you and nine other self-inter- 
ested players (representing countries) take 
part in a game. Each player has $10, some or 
all of which they may simultaneously pledge 
to acommon pot. A referee makes sure that 
they honour their pledges. Every dollar (for 
carbon dioxide abatement) placed in the pot 
will be doubled (by climate benefits) and 
distributed evenly to all players. So putting 
a dollar in the pot will return 20 cents to 
each player. 

Consider two variants of the game. First, 
in the ‘individual commitment’ version, 
pledges are made independently. This is 
the classic public-goods game, in which the 
rational selfish strategy is to contribute noth- 
ing, because this makes a player better off no 
matter what the others do. The result is the 
famous tragedy of the commons. Coopera- 
tion does not occur, even though everyone 
would gain from it. 

Second, in the ‘common commitment’ 
version, players condition their contribu- 
tions on others’ pledges: a referee ensures 
that all contribute the amount of the lowest 
pledge. After enforcing this common com- 
mitment, the money is doubled and distrib- 
uted evenly, exactly as before. 

This changes everything. Pledging 
$0 will mean simply keeping your $10, 
whereas pledging $10 could result in end- 
ing up with anything between $10 and 
$20, depending on what others pledge. So, 
because you cannot lose and could gain by 
pledging $10, that is what you would do, 
even if you are completely selfish. Since all 
parties would pledge $10, the group's $100 
is doubled and all end up with the maxi- 
mum amount of $20. 

Selfish behaviour has been changed from 
‘contribute nothing’ to ‘contribute every- 
thing; because the common commitment 


protects against free-riding. 

In 1997, the Kyoto negotiators initially 
did try to agree a common commitment, 
expressed as a formula for national emis- 
sions caps, but failed. In the end, each nation 
was simply asked to submit their final num- 
bers for insertion into the draft annex’. The 
result was a patchwork of weak and unstable 
commitments. Similarly, in response to the 
2009 Copenhagen Accord, China pledged 
emissions equal to those considered “busi- 
ness as usual’ before the accord; and India 
pledged even less. 

Enforcement is widely thought to be the 
missing ingredient in the Kyoto Protocol and 
crucial for the success of a Paris agreement. 
This is only half right — both enforcement 
and a common commitment are required. 
For example, if drivers chose their own speed 
limits, there would be no use enforcing them, 
because everyone would drive at their desired 
speed. Instead, because it limits others as well, 
people agree to a common speed limit that 
is lower than almost everyone's individual 
limit. In other words, with individual com- 
mitments, there is nothing meaningful to 
enforce, whereas enforcement strengthens a 
common commitment. 

What could all countries commit to? 
National limits on the quantity of emissions 
will not work. Kyoto negotiators suggested at 
least ten formulae to determine the reduc- 
tions that each nation should make, but could 
not agree. When attention turned to reduc- 

ing emissions by some 


“Harness percentage relative to 
self. -interest 1990 levels, individual 
by aligning commitments ranged 
it with the from an 8% decrease 
common to a 10% increase. 
good.” The United States and 


developing countries 
made no commitments at all. 

Percentage pledges failed because coun- 
tries differ; for instance, some economies 
declined after 1990 and some grew. Devel- 
oping countries fear caps that curb their 
growth. Instead they see it as fair to allocate 
emission permits on an equal per capita 
basis. Because permit sales would result in 
huge wealth transfers to poor countries, rich 
countries find such proposals unacceptable’, 

There is no longer any serious discussion of 
a common commitment to reduce the quan- 
tity of carbon emissions. 


GLOBAL CARBON PRICE 

We, and others, propose an alternative: a 
global carbon-price commitment’. Each 
country would commit to place charges on 
carbon emissions from fossil-fuel use (by 
taxes or cap-and-trade schemes, for exam- 
ple) sufficient to match an agreed global 
price, which could be set by voting — bya 
super-majority rule that would produce a 
coalition of the willing. 
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A uniform carbon price is widely accepted 
as the most cost-effective way to curb emis- 
sions. Carbon pricing is flexible, allowing 
fossil taxes, cap-and-trade, hybrid schemes 
and other national policies to be used (unlike 
a global carbon tax). All that is required of a 
country is that its average carbon price — cost 
per unit of greenhouse gas emitted — be at 
least as high as the agreed global carbon price. 

Unlike global cap-and-trade, carbon 
pricing allows countries to keep all carbon 
revenues, eliminating the risk of needing to 
buy expensive credits from a rival country. 
Taxes need not rise if a nation performs a 
green tax shift — reducing taxes on good 
things such as employment by charging for 
pollution. Shifting taxes from good things to 
bad things could mean there is no net social 
cost to pricing carbon, even before counting 
climate benefits®. 

A global price does not automatically 
result in acceptable burden sharing. A ‘Green 
Climate Fund’ will be needed to transfer 
funds from rich to poor countries. To mini- 
mize disputes, the objective of climate-fund 
transfers should be to maximize the global 
price of carbon. This can be implemented 
in a way that encourages rich countries to 
be generous and poor countries to vote for a 
higher global carbon price’, for example, by 
making all climate-fund payments propor- 
tional to the agreed carbon price. 

After decades of failure, a fresh approach 
is needed — one that is guided by the science 
of cooperation. A common price commit- 
ment would harness self-interest by aligning 
it with the common good. Nothing could be 
more fundamental. m 
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ILLUSTRATION BY DAVID PARKINS 


Use experts 
wisely 


Policymakers are ignoring evidence on how advisers 
make judgements and predictions, warn 
William J. Sutherland and Mark A. Burgman. 


any governments aspire to 
evidence-based policy and 
practice. The predominant, 


conventional approaches to using experts 
are either to seek the advice of one highly 
regarded individual, or to convene a panel 
with diverse expertise across relevant areas. 
For example, quarantine services worldwide 
routinely rely on expert judgement to esti- 
mate the probability of entry, establishment 
and spread of pests and diseases. 

The accuracy and reliability of expert opin- 
ions is, however, compromised by a long list 


of cognitive frailties’. Estimates are influenced 
by experts’ values, mood, whether they stand 
to gain or lose from a decision’, and by the 
context in which their opinions are sought. 
Experts are typically unaware of these subjec- 
tive influences. They are often highly credible, 
yet they vastly overestimate their own objec- 
tivity and the reliability of their peers’. 
Happily, a large and growing body of 
literature describes methods for engaging 
with experts that enhance the accuracy and 
calibration of their judgements*”. Unhappily, 
these methods are rarely used to support 


public-policy decisions. All the methods 
strive to alleviate the effects of psychologi- 
cal and motivational bias; all structure the 
acquisition of estimates and associated 
uncertainties; and all recommend combin- 
ing independent opinions. None relies on 
the opinion of the best-regarded expert or 
uses unstructured group consensus. 

The cost of ignoring these techniques — 
of using experts inexpertly — is less-accurate 
information, and thus more frequent and 
more serious policy failures. 


KNOWNS AND UNKNOWNS 

For an important subset of questions, expert 
technical judgements about facts plays a 
part in policy and decision-making. (We 
appreciate that political context may deter- 
mine what comprises relevant, convinc- 
ing evidence, and that that evidence rarely 
leads directly to policy and action because 
decision-makers must balance a range of 
political, social, economic, practical and 
scientific issues.) 

Policymakers use expert evidence as 
though it were data. So they should treat 
expert estimates with the same critical rigour 
that must be applied to data. Experts must 
be tested, their bias minimized, their accu- 
racy improved, and their estimates validated 
with independent evidence (see ‘Eight ways 
to improve expert advice’). That is, experts 
should be held accountable for their opinions. 

For example, experts who are confident 
and routinely close to the correct answer pro- 
vide more information than do experts who 
regularly deviate from the correct answer or 
are under-confident. Highly regarded experts 
are routinely shown to be no better than nov- 
ices at making judgements. Opinions from 
more-informative experts can be weighted 
more heavily, whereas the opinions of some 
experts may be discarded altogether®. These 
strategies will illuminate where advice is 
robust, and where it is contradictory, self- 
serving or misguided. This will generate 
evidence for policy decisions that is more 
relevant and reliable. Roger Cooke, a risk- 
analysis researcher at the Delft University of 
Technology in the Netherlands and his col- 
leagues have used this approach effectively to 
better predict the implications of policy for 
transport and nuclear-power safety’. 

Experts themselves must make explicit 
the sensitivity of their decisions to scien- 
tific uncertainty, assumptions and caveats. 
When invited to advise, they should demand 
that state-of-the-art techniques are used to 
harvest and process what they offer. If not, 
all involved risk wasting substantial time, 
resources and opportunities. 

Importantly, all parties must be clear 
about what they expect of each other: esti- 
mates of facts, predictions of event outcomes 
or advice on the best course of action. Prop- 
erly managed, experts can help with the 
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first two. Providing advice assumes that the 
expert shares the same values and objectives 
as the decision-makers. 

Several processes have been shown to 
improve experts’ performances on esti- 
mates of facts and predictions of event 
outcomes. In using specialists to weigh up 
the best course of action, the researchers 
themselves, and the policymakers using 
them, should identify all possible changes, 
options and threats, a process known as 
horizon scanning. Policymakers must list all 
possible known solutions using a wide group 
of experts and reference to the literature, to 
reduce the risk that valuable alternatives are 
overlooked (known as solution scanning’). 

Deliberations should be underpinned bya 
systematic collection of evidence, an assess- 
ment of its relevance, and an identification 
of the knowledge gaps that might change the 
decision. The information can be collated 
so that it is ready for use — rather than in 
response to a policy need — as is being done 
for biodiversity* (see www.conservationevi- 
dence.com). 


RULES OF ENGAGEMENT 

A few more rules of engagement, routinely 
applied, will enhance the quality and reliabil- 
ity of expert judgements. 

Ensure that questions are fully specified 
and unambiguous, so that language-based 
uncertainties do not cloud judgements. For 
example, a seemingly straightforward ques- 
tion such as ‘How many diseased animals 
are there in the region?’ could be interpreted 
differently by different people. The question 


does not specify whether to include only 
those animals that are known to be infec- 
tious, or also those that have died, have 
recovered, are diseased but yet to be identi- 
fied as such, and so on. 

Structured question formats counter 
tendencies towards over-confidence for 
individual estimates. For example, Andrew 
Speirs-Bridge at La Trobe University has 
shown’ that questions that elicit four 
responses — upper and lower bounds, a best 
guess and a degree of 
confidence in the 


interval — generate ee 
estimates that are rela- question 

tively well calibrated. formats 
Consider a range of counter 
scenarios and alterna- tendencies 

tive theories. Ensure towards oer 
that several experts confidence 


answer each question. 

Unstructured group interactions are sub- 
ject to ‘groupthink’: the group gravitates 
towards an initial or even an arbitrary esti- 
mate; dominant individuals drive the out- 
come; or individuals are ascribed greater 
credibility than they deserve because of their 
appearance, manner or professional back- 
ground. Structured, facilitated interactions 
counter factors such as these, which distort 
estimates’. 

Review assumptions, reconcile misun- 
derstandings and introduce new informa- 
tion. Ensure that decision-makers do not 
rely on experts to choose between options 
but rather use an appropriate decision tool. 
One such is structured decision-making, in 


LEVERAGE THE LITERATURE 


Eight ways to improve expert advice 


Use groups. Their estimates consistently 
outperform those of individuals. 


Choose members carefully. Expertise 
declines dramatically outside an 
individual’s specialization or experience. 


Don’t be starstruck. Age, number of 
publications, technical qualifications, 
years of experience, memberships 

of learned societies and apparent 
impartiality do not explain an expert’s 
ability to estimate unknown quantities 
or predict events. This finding applies in 
studies from nuclear-safety systems and 
geopolitics to ecology. 


Avoid homogeneity. Diverse groups tend to 
generate more-accurate judgements. 


Don’t be bullied. People who are less self- 
assured and assertive, and who integrate 


information from diverse sources tend to 
make better judgements. 


Weight opinions. Calibrate experts’ 
performance with test questions. This 
improves risk estimates in many domains, 
including earthquakes and nuclear-safety 
systems. 


Train experts. Training can improve experts’ 
abilities to estimate probabilities of events, 
quantities or model parameters. 


Give feedback. Chess players, weather 
forecasters, sports people, gamblers, 
intensive-care physicians and physicists 
solving textbook problems generally make 
accurate judgements, probably as a result 
of rapid feedback from mistakes that are 
visible and personal. Experts deserve 

the same — give them immediate and 
unambiguous feedback. 
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which experts populate decision tables with 
estimates of the expected outcomes for each 
criterion under each policy option, but do 
not decide the best option. 

For example, an analysis® of volcano- 
eruption risks by Willy Aspinall, an Earth 
scientist at the University of Bristol, UK, 
used structured interactions. These sub- 
stantially improved the quality of estimates, 
because he ensured that well-specified ques- 
tions were answered by several experts in 
such a way that he avoided or mitigated the 
psychological tripwires that compromise 
many group interactions. 

Similarly, a study led by conservation 
ecologist Marissa McBride’’ at the Univer- 
sity of Melbourne in Australia engaged with 
groups of experts remotely, using structured 
questions and group interactions to assess 
the conservation status of threatened Aus- 
tralian birds. They used telephone confer- 
ences to outline the context and purpose of 
the interactions, which was to reassess the 
International Union for Conservation of 
Nature's Red List assessments for a suite of 
threatened species. They then used e-mail to 
elicit initial judgements and to clarify ques- 
tions, introduce further data and explana- 
tions. Finally, they circulated a spreadsheet 
and compiled a second round of private, 
anonymous judgements. 

In many cases, incorporating the formal 
stages described here will improve decision- 
making. The benefits are substantial improve- 
ments in the reliability of judgements, 
relatively free of personal biases and values. 
The costs in time and resources are modest. m 


William J. Sutherland is professor of 
conservation biology in the Department 
of Zoology, University of Cambridge, UK. 
Mark Burgman is professor of botany 
and managing director of the Centre of 
Excellence for Biosecurity Risk Analysis, 
School of BioSciences, University of 
Melbourne, Parkville, Australia. 

e-mail: wjs32@cam.ac.uk 
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For a list of further reading on this topic, see 
go.nature.com/ibrrqd. 


| COMMENT | BOOKS & ARTS 


Fear of robots overlooks their limitations, and potential. 


Countering singularity 
sensationalism 


Ken Goldberg reviews three books that probe the nexus 


of people and robots. 


States was awash with the racist term ‘yel- 
low peril. Fears spread that Chinese immi- 
grants working in the country’s mines and 
building its railways would seize more jobs 
from the citizenry. Today, there is a similar 
collective fear, this time about a ‘singularity’ 
in which artificial intelligence (AI) and robots 
surpass human abilities. In May 2014, for 
instance, physicists Stephen Hawking, Frank 
Wilczek and Max Tegmark, with computer 
scientist Stuart Russell, warned in the newspa- 
per The Independent: “Success in creating AI 
would be the biggest event in human history. 
Unfortunately, it might also be the last”. 
Surprising advances are being achieved, 
for example in ‘deep learning — a method 
for approximating complex functions using 
thousands of numerical parameters. And 
robots are evolving, with advances in 3D 
sensing and mapping. But progress is not 
nearly as steady as some claim. Three books 
explore the topic from different perspectives. 
All suggest that robot superiority faces a 
formidable obstacle: human psychology. 
In Machines of Loving Grace, reporter John 
Markoff of The New York Times highlights 


IE the late nineteenth century, the United 


Machines of Loving Grace: The Quest 
for Common Ground Between 
Humans and Robots 

JOHN MARKOFF 

Ecco: 2015. 


Rise of the Robots: Technology and the 
Threat of a Jobless Future 

MARTIN FORD 

Basic: 2015. 


Our Robots, Ourselves: Robotics and the 
Myths of Autonomy 

DAVID A. MINDELL 

Viking: 2015. 


the compelling contrast between AI and 
intelligence amplification (IA). He chroni- 
cles the fascinating and often antagonistic 
evolution of these fields since 1956, when 
both terms were coined. 

Markoff has been interacting with lead- 
ing researchers for the past 20 years. As he 
shows, despite early optimism, creating AI 
has turned out to be extremely difficult. 
Robotics remains challenged by Moravec'’s 
paradox: tasks that are tough for humans, 
such as precision spot welding, are easy 
for robots, whereas tasks that are easy for 
humans, like reliably clearing a dinner table, 


320 | NATURE | VOL 526 | 15 OCTOBER 2015 


© 2015 Macmillan Publishers Limited. All rights reserved 


remain extremely hard for robots. This is 
mostly attributable to the inherent com- 
plexity of friction, collisions and contact 
mechanics. It is much easier to calculate the 
precise trajectory of a comet than to predict 
that ofa coffee mug pushed across a tabletop. 

The term IA was preferred by Douglas 
Engelbart, who worked on the potential of 
computers to amplify human abilities. This 
evolved into the field of human-computer 
interaction, which brought us the mouse and 
graphical interfaces. Markoff recounts the 
stories of pioneers such as computer scien- 
tist Terry Winograd and augmented-reality 
expert Gary Bradski. Both recognized the 
limitations of AI, and became advocates of IA. 
Markoff’s book makes a strong case that the 
success of AI will depend on advances in IA. 

Rise of the Robots by software entrepreneur 
Martin Ford proclaims that AI and robots are 
about to eliminate most jobs, blue- and white- 
collar. A close reading reveals the evidence as 
extremely sketchy. Ford has swallowed the 
rhetoric of futurist Ray Kurzweil, and repeat- 
edly asserts that we are on the brink of vastly 
accelerating advances based on Moore’ law, 
which posits that computing power increases 
exponentially with time. Yet some computer 
scientists rue this exponential fallacy, argu- 
ing that the success of integrated circuits has 
raised expectations of progress far beyond 
what historians of technology recognize as 
an inevitable flattening of the growth curve. 

Nor do historical trends support the 
Luddite fallacy, which assumes that there is 
a fixed lump of work and that technology 
inexorably creates unemployment. Such rea- 
soning fails to consider compensation effects 
that create new jobs, or myriad relevant fac- 
tors such as globalization and the democ- 
ratization of the workforce. Ford describes 
software systems that attempt to do the work 
of attorneys, project managers, journalists, 
computer programmers, inventors and 
musicians. But his evidence that these will 
soon be perfected and force massive lay-offs 
consists mostly of popular magazine articles 
and, in one case, a conversation with the 
marketing director of a start-up. 

In Our Robots, Ourselves, teleroboticist 
David Mindell points out that autonomous 
systems are not new. Since the 1970s, they 
have been in daily use at very low and very 
high altitudes, for deep-sea and space explo- 
ration and in almost all aircraft. Drawing 
on extensive experience, Mindell explains 
that although such systems have evolved, 
many experts continue to mistrust them. 
For example, there is a persistent school of 
thought that oceanographers must experi- 
ence the murky depths directly to under- 
stand the wonders lurking there. Yet robotic 
submarines, tele-operated through fibre- 
optic cables, are much more agile, are able to 
explore for longer and do not require expen- 
sive certification for every modification. 


IKON IMAGES/REX SHUTTERSTOCK 


Mindell recounts a telling scene. In 1977, 
oceanographer Robert Ballard of the Woods 
Hole Oceanographic Institution in Massa- 
chusetts and microbiologist Holger Jannasch 
descended in a submersible. They came 
across a sea-floor vent field. As Ballard looked 
through the glass portal at the “crab gradient” 
near the fissure’s source, he realized that Jan- 
nasch had his back to it. When asked why, 
Jannasch replied that the view was “better 
here’; indicating the television image relayed 
from the craft’s camera. In an ‘aha moment, 
Ballard realized that the view will always be 
better at the surface, where high-resolution 
video from below can be viewed in comfort. 
Yet many continue to argue otherwise. 

Mindell looks at the compelling historical, 
cultural, political, psychological, philosophi- 
cal and public-relations justifications for 
keeping people in control. A recent example 
is the July 2015 international petition to ban 
autonomous weapons, signed by almost 3,000 
researchers. Consider, for instance, Google's 
announcement that it had found the most 
unreliable component in its autonomous 
cars: human drivers trying to take over. Its 
response was to remove the steering wheel. 
Mindell explains why this is a mistake. In 
what he calls the “myth of full autonomy’, he 
points out that a machine may operate on its 
own for intervals, but that no machine works 
entirely independently: human intentions, 
assumptions and parameters are built into all 
machines. Mindell’s experience leads him toa 
similar conclusion to Markoff’s: the essential 
(and most difficult) challenge is designing the 
interfaces that keep humans in the loop. 

Technological progress is not deter- 
ministic. There have been several cycles of 
irrational enthusiasm followed by disap- 
pointment and the evaporation of research 
and development funding, known as AI win- 
ters. The latest round of expectations feels 
similarly exaggerated. 

Alarmist writing may hasten the next 
slump and distract attention from a more 
realistic and important development, which 
we might call multiplicity. Multiplicity char- 
acterizes an emerging category of systems 
in which diverse groups of humans work 
together with diverse groups of machines to 
solve difficult problems, consistent with the 
points made by Markoff and Mindell. Multi- 
plicity thrives at the intersection of AI and 
IA, combining the wisdom of crowds with 
the power of cloud computing. As designer 
and computer scientist John Maeda has put 
it, it is not us versus the machines; it is us 
and the machines. There is much to gain by 
joining forces. m 


Ken Goldberg is professor of engineering 
and director of the People and Robots 
Initiative at the University of California, 
Berkeley. 

e-mail: goldberg@berkeley.edu 


Books in brief 


The Reproach of Hunger: Food, Justice and Money in the 21st 
Century 

David Rieff VERSO (2015) 

As refugee crises fill the news, David Rieff reminds that hunger is a 
war not won. Rieff, a veteran thinker on development issues, spent 
six years researching the nexus of population, food commodification 
and persistent poverty for this critical analysis. Scathing about 

the alarmist or over-optimistic pronouncements of development 
officials, agribusiness multinationals and philanthropic nabobs, he 
notes that any issue involving billions of humans cannot be neatly 
engineered. Thoughtful, trenchant and bracingly sceptical. 


The End of Memory: A Natural History of Aging and Alzheimer’s 
Jay Ingram THOMAS DUNNE (2015) 

Alzheimer’s disease affects 5.3 million US citizens, and has so far 
eluded cure. In this deft overview, science writer Jay Ingram unravels 
the complexities of the science past and present. He examines the 
legacy of neurology pioneers such as Aloysius Alzheimer and Frederic 
Lewy; the biology of ageing and shifts in episodic and autobiographical 
memory; and the protein plaques and neurofibrillary tangles 
associated with the disease. And there is more, from the ongoing Nun 
Study of Aging and Alzheimer’s Disease (begun by neurologist David 
Snowdon in 1986) to the idea of Alzheimer’s as “type 3 diabetes”. 


Houston, We Have a Narrative: Why Science Needs Story 

Randy Olson UNIVERSITY OF CHICAGO PRESS (2015) 

Whether synthetic biology or exoplanet hunting, science told well 
can carry a thriller-like punch. Marine biologist turned filmmaker 
Randy Olson argues that narrative skill is central not just to science 
communication but also to research reportage, preventing false 
positives, yawn-worthy delivery and more. Olson prescribes the 
Hollywood formula “and, but, therefore” as the backbone of story, 
introducing momentum, conflict and resolution. He has packed his 
solid primer with analyses of how it is done, from James Watson’s 
1968 The Double Helix (Atheneum) to exemplary scientific abstracts. 


Memory and Movies: What Films Can Teach Us about Memory 
John Seamon MIT Press (2015) 

Cinema has long exploited the dramatic potential of memory. Here, 
John Seamon exploits film’s potential for elucidating neuroscience. 
Inspired by Christopher Nolan’s 2000 Memento (which hinges on 
anterograde amnesia), Seamon trains a cinematic lens on aspects 
of memory from facial recognition to dementia. Philip Kaufman’s 
1978 Invasion of the Body Snatchers, for instance, mirrors Capgras’ 
delusion, in which people believe that their acquaintances are 
doppelgangers; while Robert Redford’s Ordinary People (1980) 
dissects post-traumatic stress disorder with exquisite precision. 


Ornithological Photographs 

Todd Forsgren DAYLIGHT (2015) 

Photographer Todd Forsgren has spent years capturing images of 
birds, from hummingbirds to toucans, caught in mist nets — a tool 
widely used by ornithologists for ring-banding and data collection. 
Some may find the sight of immobilized birds in this collection 
disturbing. But Forsgren’s book uniquely showcases the birds’ 
individuality while testifying to the painstaking, ongoing work of field 
researchers striving to understand the ecology, population flux and 
more of wild birds. Barbara Kiser 
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Refuse the Hour explores the unshakable anxiety of disappearing forever. 


Q&A Peter Galison 
Time transformer 


Next week, Refuse the Hour, a chamber opera about time, opens at the Brooklyn Academy of 
Music in New York City. The work is a collaboration between physics historian Peter Galison 
and South African multimedia artist William Kentridge. Galison talks about the nexus of 
technology and imperial conquest, the ‘twin paradox associated with Einstein’ special theory of 
relativity and the metaphorical resonance between black holes and mortality. 


How did this 
collaboration begin? 

I met William 
Kentridge through 
a mutual friend who 
was interested in fos- 
tering collaboration 
between science and 
art. What struck me 
was the probing psychological insight that 
William brought to the history of empire, 
how he depicted colonialism and its technical 
culture, surrounded by the terror of the his- 
torical moment. We began to focus on time. 
Early on, William said, “I don’t want to make 
an illustrated science lecture.” I said, “That’s 
great, because I don't want to be the science 
adviser to an art project.” Instead, we kept 
returning to scientific arguments about the 
nature of time and using them as jumping- 
off points — trying to capture, in affect and 
aesthetics, a sense of what gave them force. 


What will audiences see in Refuse the Hour? 
We originally had in mind an elaborate com- 
bination of installation and performance, but 
we then split the project into a five-channel 
video installation called The Refusal of Time, 
and an 80-minute chamber opera, Refuse the 
Hour. The opera is about the transforma- 
tions that we have witnessed in our cultural 


and scientific ideas of 
time: Isaac Newton's 
absolute time, Albert 
Einstein’ relative time 
and the threatened end 
to time and space as one approaches the cen- 
tre of a black hole. When the opera makes its 
North American premiere, the audience will 
see projections of giant metronomes, African 
singers with megaphones, a giant accordion- 
like ‘breathing machine’ and William lectur- 
ing on procrastination, entropy and empire. 


Refuse the Hour 
22-25 October 2015. 
Brooklyn Academy of 
Music, New York City. 


How were you inspired by historical 
timekeeping? 

At the turn of the twentieth century, to meas- 
ure longitude well enough to map the world’s 
coastlines, colonial powers such as Britain 
needed to coordinate timekeeping across 
the globe. In a period when most countries 
lacked electric light, these powers were 
stringing telegraph cables under the ocean 
from Europe to Africa and South America. 
It was an extraordinary moment of scientific 
ambition and brute imperialism. 


Was there a backlash against the 
standardization of time? 

The opera shows both hope and resistance 
associated with advances in timekeeping. In 
the 1870s, authorities in Paris and Vienna 
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tried to align the cities’ clocks by sending 
pulses of air in copper tubes beneath the 
streets. Many people resisted. New York City 
mayor Franklin Edson argued for the con- 
ventionality of time, pushing back against 
those who fulminated against the loss of 
Sun-borne time. In Paris, the poet Georges de 
Porto-Riche sued the city because he believed 
the mechanical pulses had destroyed his 
creativity. The opera includes a filmed melo- 
drama about Martial Bourdin, a French anar- 
chist who, in 1894, tried to blow up the Royal 
Observatory in Greenwich, London, where 
the global standard time was kept. 


How does Einstein fit in? 

The fury was even greater when the young 
Einstein proposed the radical idea that time 
is not absolute. His 1905 special theory of rel- 
ativity implied that every person in motion 
carries a private time. This is encapsulated in 
the twin paradox that quickly emerged from 
Einstein's theory, in which a space traveller 
returns to Earth after a high-speed journey 
to find that his stay-at-home twin is older 
than he is. In the opera, we allude to these 
imagined twins, as well as time slowed and 
accelerated, reversed and put back forward. 


How does your forthcoming film 
Containment take on the topic of time? 
Made with film-maker Robb Moss, 
Containment is a feature documentary about 
an extraordinary assignment demanded by 
the US government. To safely store nuclear 
waste underground, the Department of 
Energy has to decide how to warn future 
populations that the waste is there for a 
period not less than 10,000 years. That is 
a long span to consider: 10,000 years ago, 
prehistoric monument Stonehenge was 
science-fiction far in the future. 


What did you learn from working with 
Kentridge? 

One of the great pleasures of the collaboration 
has been starting with the science but follow- 
ing chains of association far beyond what I do 
as a physicist, historian or even film-maker. 
At one point, we began with a question that 
divides scientists today: if you throw an ency- 
clopaedia into a black hole, is the information 
gone forever or does it somehow survive? One 
day, inspired by a player-piano in William's 
studio, we realized that we could project light 
through rolls of perforated paper to invoke 
information falling into a black hole. We also 
made a parade of silhouettes marching into 
darkness, including two men struggling over 
a clock. Time is about physics, of course, but it 
is also, even for the most hardboiled scientist, 
about mortality, and the unshakable anxiety 
about disappearing forever. m 
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This interview has been edited for length and clarity. 
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Climate justice more 
vital than democracy 


Decision-making based on 
social-justice principles could be 
more effective than democratic 
efforts against climate change (see 
N. Stehr Nature 525, 449-450; 2015). 

Democratic decision-making 
involves multiple stakeholders, 
and democracy emphasizes 
the mutual roles of actors: all 
preferences are treated as equal. 
In many regions of the world, 
however, the results of democratic 
choices can be strongly 
influenced by power relations and 
inequitable social arrangements, 
owing to differences in economic 
development, access to 
technology and knowledge. 

Elites may use democratic 
processes to entrench their status 
or encroach on other social 
goals (B. Sovacool et al. Nature 
Clim. Change 5, 616-618; 2015). 
This can lead to incremental or 
undesirable results, which might 
explain why large democratic 
nations such as the United States 
continue to oppose progressive 
climate legislation. 

In our view, sound climate and 
energy planning should not treat 
all stakeholders in the same way. 
Instead, preferences and roles 
should be weighted to consider 
criteria related to equity, due 
process, ethics and other justice 
principles. This would ensure 
that stakeholder discussions 
and resulting policies serve to 
eradicate, rather than exacerbate, 
socio-economic vulnerability to a 
changing climate. 

Jingzheng Ren, Michael Evan 
Goodsite University of Southern 
Denmark, Odense, Denmark. 
Benjamin K. Sovacool Aarhus 
University, Herning, Denmark. 
benjaminso@auhe.au.dk 


Interdisciplinarity: 
two more principles 


As scientists in a sustainable- 
development research institute, 
our experience has demonstrated 
two further principles that 

are crucial for successful 


interdisciplinary research 
(see R. R. Brown et al. Nature 525, 
315-317; 2015). 
First, solving real-world 
problems requires integration 
of the social and biophysical 
realms. Key to this is the explicit 
consideration of differences 
in scale. For example, water 
governance has a typical time 
horizon of years, yet hydrological 
processes operate on timescales 
from seconds to millennia. 
Second, a critical mass of 
interdisciplinary scholars is 
necessary to become a credible 
counterpart for disciplinary 
researchers and to catalyse 
progress. Our institute employs 
150 interdisciplinary researchers, 
who engage much greater numbers 
of single-discipline researchers in 
interdisciplinary projects. 
Martin J. Wassen, Marko P. 
Hekkert Utrecht University, 
the Netherlands. 
m.j.wassen@uu.nl 


Interdisciplinarity: 
topping the charts 


We offer ideas on why India 
ranks highest and Brazil fifth 

— above the United States 

and Europe — in terms of the 
numbers of interdisciplinary 
research papers that they publish 
in Elsevier journals (see Nature 
525, 306-307; 2015). 

Indias researchers are 
surrounded by dynamic 
ecological, social and economic 
problems. They therefore 
naturally turn to solution- 
oriented research that is 
informed by the complexity of 
their environment. Although 
interdisciplinary researchers still 
struggle for acceptance in India’s 
traditional, rigidly structured 
university departments, new 
universities with centralized 
programmes are offering fresh 
and tempting perspectives. 

In Brazil, the number of 
interdisciplinary graduate 
programmes is growing twice as 
fast as the number of disciplinary 
courses. The country’s national 
accreditation system changed 


15 years ago, when many of its 
master’s and PhD programmes 
failed to fit any of their set 

46 disciplinary slots. 

Marcel Bursztyn University of 
Brasilia, Brazil. 

Seema Purushothaman Azim 
Premji University, Bangalore, India. 
marcel@unb.br 


Restore vaccine 
trust in Japan 


Vaccination coverage for human 
papilloma virus (HPV) in 

Japan has plummeted following 
unconfirmed media reports of 
severe adverse reactions such 
as chronic pain and cognitive 
decline (see S.J. B. Hanley et al. 
Lancet 385, 2571; 2015). In 
response, and presumably to 
avoid lawsuits, the government 
has suspended its proactive 
vaccination recommendation. 

Several scandals in the past few 
decades have fuelled the Japanese 
public’s scepticism. These 
include health hazards associated 
with some drugs and vaccines, 
widely publicized cases of 
misconduct among researchers, 
and questionable connections 
between medical professionals 
and pharmaceutical companies 
(see T. Tanimoto et al. Nature 
512, 371; 2014). 

At the heart of the debate is 
Japan’s compensation scheme for 
vaccine injury, which depends 
ona causal relationship being 
confirmed by the government. 
Claimants may also seek 
further compensation through 
private litigation against the 
government. In most countries, 
claimants must pursue one 
scheme or the other, not both. 

In our view, the government 
should introduce a no-fault 
compensation system, following 
the lead of other countries, 
including the United States 
and Scandinavian nations 
(see C. Looker and H. Kelly 
Bull. World Health Organ. 89, 
371-378; 2011). Exemption 
from liability would make for 
a fairer and more transparent 
judgement in policymaking and 


pave the way for a new Japanese 
vaccination programme in which 
public trust is restored. 

Tetsuya Tanimoto, Eiji Kusumi 
Navitas Clinic, Tokyo, Japan. 
Claire Leppold University of 
Edinburgh, UK. 
tetanimot@yahoo.co.jp 


Some rules for 
behavioural science 


US President Barack Obama's 
executive order of 15 September, 
‘Using behavioral science 
insights to better serve the 
American people; is intended 
to strengthen the policymaking 
process. To prevent the Obama 
administration from falling 
victim to agendas backed by 
pseudoscience, I suggest that 

it should consider only studies 
that abide by certain established 
rules. 

The rules were instituted in 
behavioural and experimental 
economics research in 
universities, largely in response 
to scientific misconduct by 
governments and academics. 
They stipulate that an 
independent ethics committee 
must review research involving 
human subjects before the 
start of any experiment; that 
participants should first 
give their informed consent; 
and that they should receive 
appropriate remuneration to 
ensure that their decisions reflect 
genuine preferences. Moreover, 
deception should never be used 
in experimental design or in the 
process of scientific enquiry. 

In the interests of scientific 
transparency, studies that lead 
to policy change should be peer 
reviewed and experimental 
design details and data made 
publicly available. 

Such guidelines would help to 
ensure that studies are relevant, 
rigorous, ethical and legal. They 
would also provide a means to 
replicate and test studies for bias, 
efficacy and accuracy. 

Jason A. Aimone Baylor 
University, Waco, Texas, USA. 
jason_aimone@baylor.edu 
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OBITUARY 


William E. Paul 


(1936-2015) 


A leading force in immunology. 


illiam Erwin Paul was a major 
contributor to the develop- 
ment of modern immunol- 


ogy. He helped to transform cytokine 
biology, the study of small proteins 
involved in cell signalling, from crude 
assessments of uncharacterized cellular 
‘factors’ into a science involving precise 
quantitative molecular analyses. He also 
elucidated the mechanisms controlling 
the production of antibodies — proteins 
that recognize and bind to specific anti- 
gens such as bacteria and viruses — and 
provided insights into how antigens are 
recognized by T cells, a type of white 
blood cell. 

Paul, who died on 18 September, was 
born in 1936 in Brooklyn, New York. 
He prided himself on the fact that his 
higher education was at public rather 
than private institutions. He obtained 
his undergraduate degree in 1956 from 
Brooklyn College and his medical 
degree in 1960 from the State Univer- 
sity of New York’s Downstate Medi- 
cal Center, also in Brooklyn. While at 
medical school, he married Marilyn Heller. 

During a two-year medical residency at 
Massachusetts Memorial Hospitals (now 
Boston Medical Center), Paul worked on 
amyloidosis, a rare disease that occurs when 
a protein called amyloid builds up in tissues. 
This led to his first paper, in Nature. More 
than 600 publications would follow. 

Paul joined the US National Institutes of 
Health (NIH) in 1962. While working in 
the endocrinology branch of the National 
Cancer Institute in Bethesda, Maryland, he 
helped to establish that a chemotherapy drug 
called methotrexate was extremely effective 
in treating women with choriocarcinoma, a 
cancer that usually occurs in the placenta. He 
also helped to develop radioimmunoassays 
for hormones — in vitro techniques used to 
measure hormone levels in the blood. These 
studies were just a prelude to the real blos- 
soming of Paul’s research career. 

Paul had developed a strong interest in 
immunology as a student. He recounted in 
a memoir that he was smitten by “a slender 
volume of essays by Michael Heidelberger, 
the father of quantitative immunochemis- 
try’, which he had read while riding a tram in 
Brooklyn. Paul sought advice from colleagues 
on the best places to train in immunology, 
and in 1964 joined Baruj Benacerraf’s group 
at New York University in New York City. 


(Benacerraf later won the 1980 Nobel Prize 
in Physiology or Medicine for his work on the 
genetics of the immune response.) 

Paul moved with Benacerraf to the NIH in 
1968. When Benacerraf left to become chair 
of pathology at Harvard Medical School in 
Boston in 1970, he encouraged the appoint- 
ment of Paul as his successor. Paul remained 
chief of the Laboratory of Immunology at 
the National Institute of Allergy and Infec- 
tious Diseases until his death. 

Under his leadership, the lab emerged as 
a premier centre of immunology research. 
This was a result of both Paul’s remark- 
able scientific accomplishments and his 
astute guidance of the department. He gave 
both support and independence to those 
he recruited to faculty positions (myself 
included). 

Paul is best known for his research on 
the cytokine interleukin-4 (IL-4), which he 
discovered with Maureen Howard, a post- 
doctoral fellow in his laboratory. He went 
on to show how the cytokine both helped 
to mobilize the body’s defence mechanisms 
against parasitic worms and played a part 
in the production of allergic symptoms. He 
also mapped out the IL-4-dependent sig- 
nalling pathway, the genetic control of the 
cytokine’s expression and the factors that 
prompted certain types of T cell to produce 
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it. This all-encompassing and rigorous 
approach became the prototype for 
studies on immune-system cytokines. 

Paul’s impact on science and health 
went further. As the first director of 
the NIH Office of AIDS Research from 
1994 to 1997, he set the agenda for 
research on HIV/AIDS in the United 
States. As Mark Harrington, executive 
director of an advocacy group called the 
Treatment Action Group, noted, Paul’s 
work led to “the emergence of highly 
active antiretroviral therapy, which is 
now being taken by over 15 million peo- 
ple around the world and is responsible 
for saving millions of lives and prevent- 
ing millions of infections”. 

Paul headed up several societies 
and served on innumerable advisory 
panels, prize committees and editorial 
boards. For 31 years, he served as edi- 
tor of the Annual Review of Immunol- 
ogy; for much of that time, it was the 
most cited publication in biomedicine. 
He was also widely known for his text- 
book Fundamental Immunology, first 
published in 1984. 

Bill was highly respected for being both a 
scholar and a gentleman. He could find flaws 
in any seminar he attended, but when asking 
questions, he made the speaker feel as if they 
was being praised, not skewered. 

He remained his optimistic, ebullient, 
engaged self to the end. Only weeks before 
his death, he was calling colleagues to dis- 
cuss ongoing research; attending to his 
roles as mentor, colleague and administra- 
tor; and completing his book Immunity 
(Johns Hopkins Univ. Press; 2015), in which 
he explains to a lay audience how the same 
system that defends the body can also cause 
autoimmune diseases. 

His intellectual strength, encyclopaedic 
knowledge and laser-like focus on science 
— rather than on scientific politics — were 
unique. Friends, colleagues and immunolo- 
gists around the globe mourn his passing. m 
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Decrypting a brain enigma 


The combined neuronal activity of two seemingly opposite types of Purkinje cell in the brain’s cerebellum has been found 
to be required to control the jerky eye movements known as saccades in monkeys. SEE LETTER P.439 


KAMRAN KHODAKHAH 


he brain has tremendous information- 

processing power and computational 

capacity. Neuroscientists have made 
great efforts to unravel how the brain encodes, 
decodes and processes information, but 
deconstructing these computations is a dif- 
ficult task. On page 439 of this issue, Herzfeld 
etal.’ use monkeys to masterfully unearth how 
one of the brain’s most intriguing structures, 
the cerebellum, efficiently encodes — and per- 
haps subsequently decodes — the information 
needed to control the quick, jerky eye move- 
ments known as saccades, which occur as the 
eye explores a scene. 

Deciphering how the brain computes is 
exceptionally difficult, because it requires a 
reverse-engineering approach. Without any 
prior knowledge of how the brain’s circuits 
are designed, neuroscientists need to dissect 
these circuits to understand their function and 
computational principles. Consider the heroic 
efforts needed to reverse-engineer the Enigma 
machine, which was used during the Second 
World War to encrypt and decrypt military 
messages. By analogy, each of the brains differ- 
ent computational units (neurons or neuronal 
circuits that process the information contained 
in their inputs to determine appropriate out- 
puts) can be considered to be like an Enigma 
machine, with its own algorithm and code. 
Although Enigma’s purpose was to encrypt 
and subsequently decrypt the same message, 
brain circuits process the information that 
they receive and often completely transform 
it to generate a new message. Furthermore, we 
do not fully understand the different types of 
information processing that occur in different 
brain regions. 

However, this problem can be made more 
manageable by the thoughtful selection of 
specific brain regions to study. The more infor- 
mation there is about the inputs and outputs 
of a brain region, the details of its neuronal 
circuitry, the nature of the information it pro- 
cesses, and how and why it transforms the 
information, the better the chance of crack- 
ing its codes and understanding the purpose 
of its computations. Fortunately, because 
different brain regions often use the same or 
similar principles, understanding the code that 
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Figure 1 | Wiring up saccades. Neuronal projections called climbing fibres that originate in a brain 
region called the inferior olive encode the direction of saccades (quick, jerky eye movements that occur 
when exploring a scene; directional tuning indicated by arrows). Climbing fibres send input to Purkinje 
cells in the brain's cerebellum (although note that the activity of the Purkinje cells is primarily driven by 
projections called mossy fibres, which are not shown). The activity of one type of Purkinje cell, called 
burst cells, transiently increases with saccades (indicated by a + symbol), whereas the activity of another 
type, pause cells, ceases when saccades begin (indicated by a - symbol). Herzfeld et al.' suggest that, in 
monkeys, pause- and burst-cell signals are integrated by the nuclear neurons to which they signal to 
coordinate the movement of the eye during saccades. In addition, the authors used computer simulations 
to show that Purkinje cells that have climbing-fibre inputs with the same saccade directional tuning may 
target the same nuclear neuron. In this way, a simple neuronal circuit controls both the direction and 
movement of the eye during saccades. (Figure adapted from ref. 1.) 


underlies one brain circuit can shed light on 
similar computations in other regions. 

One region that has great potential for help- 
ing us to understand the brain is the cerebel- 
lum, which has a fairly simple anatomy. The 
cerebellum is mainly composed of repeats of 
the same computational circuit, at the core of 
which is a type of neuron called a Purkinje cell. 
Among other roles, the cerebellum controls 
and coordinates movements, and the anatom- 
ical connections involved in these functions 
are reasonably well delineated. These features, 
along with the ease with which the cerebellum’s 
motor outputs can be quantitatively moni- 
tored, make it an ideal structure for decipher- 
ing the computational principles of some of the 
brain’s Enigma machines (neuronal circuits). 

The cerebellum is essential for the accurate 
control of saccades’, the jerky eye movements 
that endow us with high-resolution vision by 
ensuring that the high-acuity region of the 
retina is exposed to the most important com- 
ponents of an image. Electrical recordings 
from Purkinje cells have long since revealed 
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that the activity of some of these neurons 
changes with saccades**. But the individual 
activity of each saccade-related Purkinje cell 
is a poor predictor of saccade kinematics** 
— the moment-to-moment speed of the eye 
during saccades. This is in stark contrast to 
the cerebellar control of smooth eye pursuit, in 
which the activity of individual Purkinje cells 
reliably predicts, in real time, the movement of 
the eye as it follows a moving object”. 

The failure of individual Purkinje cells to 
accurately predict saccade kinematics has led 
to speculation that the combined population- 
wide activity of Purkinje cells might more 
effectively encode the required information”’. 
Two types of saccade-related Purkinje cell have 
been identified: burst cells, whose activity 
transiently increases with saccades, and pause 
cells, whose spontaneous pre-saccade activ- 
ity ceases when saccades begin. Although the 
activity of pause cells has been largely ignored, 
the population-wide activity of burst cells has 
been examined and found not to be predic- 
tive of saccade kinematics*. By analysing the 
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activity of Purkinje cells in monkeys as the 
animals made saccades, Herzfeld et al. made 
a breakthrough. They found that although 
neither the pause cells nor the burst cells pre- 
dicted saccades as an individual population, 
their combined activity accurately predicted 
saccadic eye movements. 

How might the brain decode the information 
encoded by the activity of the Purkinje-cell 
population? Herzfeld and colleagues suggest 
a plausible mechanism based on the anatomi- 
cal organization of the cerebellum. Most of the 
cerebellar output is from clusters of neurons 
compacted into structures called cerebellar 
nuclei. Each nuclear neuron receives con- 
verging information from about 50 Purkinje 
cells. This organization’, combined with the 
speed at which Purkinje cells transfer informa- 
tion”, might enable nuclear neurons to faith- 
fully integrate the population-wide activity of 
Purkinje cells in real time while preserving its 
temporal profile (Fig. 1). This possible decod- 
ing mechanism warrants careful experimental 
and theoretical scrutiny. 

How the cerebellum encodes the direction 
of saccades is another long-standing puzzle, 
and Herzfeld and colleagues again offer a solu- 
tion. Each Purkinje cell receives an input from 
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a climbing fibre — the output projection of 
neurons in the inferior olive, a brain region 
outside the cerebellum. The activity of climb- 
ing fibres is tuned to the direction of saccades". 
The authors used computer simulations to 
explore the possibility that climbing fibres 
dictate the functional organization of the cere- 
bellar circuitry. They postulated that Purkinje 
cells that receive inputs from climbing fibres 
with the same directional tuning converge on 
the same nuclear neurons (Fig. 1). Remarkably, 
they found that when this organization is mod- 
elled, the activity of the Purkinje-cell popula- 
tion encodes both the real-time motion of the 
eye and the direction of the saccade, through a 
gain field — a multiplicative encoding mecha- 
nism found in the brain’s cortex. Although 
purely speculative, this is a graceful solution 
to a difficult puzzle and is well worth experi- 
mental validation. 

In addition to its role in motor control, 
the cerebellum has also been implicated in 
cognitive tasks. It is likely that many of the 
computational principles that the cerebellum 
uses for motor coordination are also imple- 
mented in its cognitive functions. It is equally 
likely that there are more, perhaps specialized, 
algorithms dedicated to its non-motor tasks. 


The long future of 
Antarctic melting 


Simulations show that melting of the Antarctic ice sheet in response to climate 
change could raise the global sea level by up to 3 metres by the year 2300 and 
continue for thousands of years thereafter. SEE LETTER P.421 


ALEXANDER ROBEL 


ost projections of Antarctic ice 
Mee in response to climate change 
extend a maximum ofa few centu- 
ries into the future, a timescale that has clear 


relevance to immediate human affairs. But 
to capture the total Antarctic contribution to 


hw 


sea-level rise caused by climate change, it is 
necessary to consider the possibility that ice- 
sheet mass loss will continue for thousands of 
years. On page 421 of this issue, Golledge and 
colleagues’ present multi-millennial ice-sheet 
simulations in which Antarctica continues to 
contribute significantly to global mean sea-level 
rise for more than 1,000 years, long after ocean 
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Figure 1 | Ice-shelf collapse. These satellite images show the rapid break-up of 3,250 square kilometres of the Larsen B ice shelf in Antarctica in 2002. Golledge 
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Some of the brain’s toughest and most elegant 
Enigma machines remain to be cracked, and 
the cerebellum might offer unique advantages 
for tackling them. = 
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and air temperatures have stopped increasing. 
The authors’ simulations also show that ice- 
shelf melting driven by ocean warming over the 
next 100-300 years is a critical factor in deter- 
mining the total future rise in global sea level. 

The spectre of the imminent and rapid 
loss of ice from the West Antarctic Ice Sheet 
has been raised by dramatic events such as 
the collapse of the Larsen B ice shelf in 2002 
(Fig. 1). Nevertheless, the contribution of the 
Antarctic ice sheet to global sea-level rise is 
currently small in comparison with that of 
other sources, although it is increasing at an 
accelerating rate’. 

The glaciologist John Mercer was the first 
to suggest that past recessions of the West 
Antarctic Ice Sheet were driven by fluctua- 
tions in climate that increased air tempera- 
tures over ice shelves to above freezing point’. 
He proposed that ice-shelf melting causes an 
increase in the flow of ice from land that occurs 
through other processes. In many coastal 
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et al.' have simulated the retreat of the Antarctic ice sheet in response to climate change over the next few millennia. 
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regions of Antarctica, ice shelves are laterally 
constrained by bays. Confirming Mercer’s 
suspicions, numerical models*? have since 
shown that contact between the ice shelf and 
bay sidewalls exerts a restraining force on the 
flow of ice from land, leading to slower flow 
and reduced ice calving into the ocean. When 
the climate warms, ice shelves may melt, redu- 
cing the restraining contact with bay sidewalls, 
accelerating ice flow from land and enhancing 
mass loss through ice calving. 

Estimating the likelihood that climate 
change will cause widespread Antarctic ice- 
shelf loss, and the duration of ice-flow accel- 
eration in response to such a loss, is crucial for 
constraining future sea-level rise. However, 
accurately modelling ice sheets, ice shelves 
and their interactions with the ocean and 
atmosphere is computationally intensive. 
Consequently, most sea-level projections from 
models that explicitly include ice-sheet flow 
and ice-shelf melting extend only a few centu- 
ries into the future, or include only part of the 
Antarctic ice sheet. Existing multi-millennial 
projections of the ice sheet’s contribution to 
sea-level rise are based on relatively simple sta- 
tistical relationships between global tempera- 
ture and sea level derived from palaeoclimatic 
and instrumental data®”. 

In contrast to earlier studies, Golledge and 
colleagues use a comprehensive ice-sheet 
model, with forcing of precipitation, ocean and 
air temperature from global climate models, 
to simulate mass loss from the Antarctic ice 
sheet from the present to the year 5000 under 
various scenarios of climate change. They find 
that ocean warming, rather than atmospheric 
warming or changes in precipitation, is the 
dominant driver of mass loss. Ocean warming 
causes loss of large Antarctic ice shelves anda 
sustained acceleration in ice-sheet discharge 
into the ocean in all of the scenarios, except for 
one that includes the most extreme reduction 
in greenhouse-gas emissions compared with 
1990 emission levels. In the scenarios in which 
ice shelves are lost, the long-term contribution 
of the Antarctic ice sheet to global sea-level rise 
ranges from roughly 3 to 9 metres. The major- 
ity of that contribution comes after 2300, with 
enhanced rates of sea-level rise lasting until at 
least 3000 — long after ocean temperatures 
have stabilized. 

A previous theoretical analysis* found that 
high model resolution is needed around the 
grounding line — which marks the transi- 
tion from a grounded ice sheet to a floating 
ice shelf — to simulate rapid grounding-line 
migration accurately. Golledge and co-workers 
instead used a fairly coarse model resolution 
near the grounding line, because of the com- 
putational constraints of performing multi- 
millennial simulations of the entire Antarctic 
ice sheet. 

For each climate-change scenario, they 
simulated an upper bound of sea-level rise 
using a scheme for correcting errors that arise 


as a result of the coarse resolution near the 
grounding line, and a lower bound that was 
calculated by not applying such corrections. 
The sensitive dependence of projections of 
sea-level rise on such correction schemes is 
vividly demonstrated by the gap of several 
metres between the simulated lower and upper 
bounds. Efforts to develop more-accurate ways 
of representing the grounding line’ that can be 
incorporated into multi-millennial ice-sheet 
simulations should continue, and will lead to 
a considerable reduction in uncertainties in 
long-term sea-level projections. 

Golledge et al. ultimately confirm the suspi- 
cions of earlier glaciologists that the fate of ice 
shelves largely determines whether Antarctica 
contributes less than 1 metre or up to 9 metres 
to long-term sea-level rise. Although ocean 
warming is responsible for most ice-shelf melt- 
ing in these simulations, other studies””” have 
suggested that warm air temperatures lead to 
water ponding on the surface of ice shelves. 
This eventually causes hydrofracture (deepen- 
ing of fractures by the drainage of meltwater 
ponds) and the rapid break-up of ice shelves. 
Capturing such complex processes in models 
is difficult, but, in one study that included 
these effects", Antarctic ice-shelf loss is even 
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more rapid and widespread than in the current 
simulations. If such a rapid ice-shelf break-up 
does occur, then Golledge and colleagues’ sim- 
ulations might represent a best-case scenario 
for future sea-level rise. m 
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A bacterial nudge 
to T-cell function 


The epithelial cells that line the intestine have been found to sense tight 
attachment of bacteria, and to respond by producing proteins that shape 
the effector functions of the immune system’s T,,17 cells. 


SHAI BEL & LORA V. HOOPER 


ur intestines contain trillions of 
bacteria that support health by pro- 
moting digestion, blocking invading 
microorganisms and synthesizing certain vita- 
mins. Although the epithelial cells that line the 
gut separate these bacteria from deeper tissues, 
resident bacteria influence the development of 
immune cells that reside beneath the epithe- 
lium. This presents a puzzle: how do bacteria 
that are confined to the gut lumen commu- 
nicate their presence to these immune cells? 
Writing in Cell, Atarashi et al.' and Sano et al.” 
show that the gut epithelial lining provides a 
conduit for this communication by sensing 
bacterial attachment and producing proteins 
that guide immune-cell development. 
Among the hundreds of bacterial species 
that live in the intestine, a select few are espe- 
cially adept at stimulating the immune system. 
One such bacterial group is the segmented 
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filamentous bacteria (SFB). SFB have a 
distinctive appearance, being composed of 
long, segmented chains, or filaments. The 
filament ends are buried in the outward-fac- 
ing membranes of epithelial cells and thus the 
bacteria are attached tightly to the intestinal 
surface’. Despite their menacing appearance 
under the microscope (Fig. 1a), the bacteria 
do not invade the epithelial barrier. But the 
tight attachment of SFB to the gut surface 
coincides with a remarkable ability to shape 
the development and function of subepithe- 
lial immune cells. These include T},17 cells, a 
key immune-cell group that is the focus of the 
present studies. 

Numerous cell types promote immunity to 
infection. ‘Helper’ T cells (T,, cells), charac- 
terized by the presence of the CD4 protein on 
their surface, regulate immunity by producing 
cellular signalling molecules called cytokines. 
Ty, cells can differentiate into several lineages 
that differ in the cytokines they make and the 
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Figure 1 | Bacterial attachment to the intestinal surface shapes T,,17-cell 
function. a, Segmented filamentous bacteria (SFB; coloured green) attach 
tightly to the epithelial cells that line the outer surface of the intestine in mice. 
b, Atarashi et al.' provide evidence that this tight attachment of SFB to the 
intestinal epithelium shapes the function of T,,17 cells of the immune system 
by inducing epithelial-cell expression of the protein serum amyloid A (SAA). 


infections that they target. T,,17 cells produce 
the cytokines IL-17 and IL-22, which protect 
against extracellular pathogens and fungi. But 
Tyl7 cells also have a dark side — too many 
of them in the wrong place can provoke auto- 
immune and inflammatory diseases such as 
rheumatoid arthritis, multiple sclerosis and 
Crohn’ disease’. 

Working together, the two research groups 
behind the new papers previously discovered 
that SFB promote a marked accumulation of 
IL-17-secreting T,,17 cells in the intestinal 
tissues’. This accumulation correlated with 
increased resistance to bacterial infection as 
well as a heightened susceptibility to auto- 
immune disease*®. But the mystery remained 
of how a non-invasive bacterial species alters 
T-cell development on the other side of the 
epithelial barrier. 

Atarashi et al. provide compelling evidence 
that the physical interaction of SFB with the 
gut epithelium is essential to spur T,,17-cell 
development. The authors first noticed that 
SFB have a remarkable fidelity towards their 
animal host. When SFB from either mice or 
rats were introduced into microbiologically 
sterile (germ-free) mice, only the mouse SFB 
attached to the gut surface and induced T,,17- 
cell accumulation. But when the same bacteria 
were introduced into germ-free rats, it was the 
rat SFB that adhered tightly to the host and 
induced a T,,17 response. This suggested that 
adhesion of SFB to intestinal epithelial cells is 
host-specific, and that it correlates with T,,17- 
cell accumulation. 

To nail down whether this was a causal 
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relationship, the authors studied two other 
organisms that attach to the epithelium and 
induce T,,17-cell accumulation: Citrobacter 
rodentium and Escherichia coli. Unlike SFB, 
the genomes of these bacteria can be altered in 
the laboratory, and so Atarashi and colleagues 
deleted the bacterial genes that are required for 
epithelial attachment. They found that there 
were fewer T,,17 cells in the guts of mice colo- 
nized with the defective bacteria, suggesting 
that bacterial attachment causes T,,17-cell 
accumulation. 

So how does bacterial attachment trigger the 
development of cells that lurk in subepithelial 
tissues? Both Atarashi et al. and Sano et al. 
analysed epithelial cells with and without 
attached SFB to see how they differed. Among 
other things, they found that epithelial cells 
with attached SFB made more of a protein 
called serum amyloid A (SAA), expression of 
which is induced during infection or injury. 

The researchers had previously noticed that 
SFB induces SAA expression, and experiments 
with cultured T cells had suggested that SAA 
might coax T cells to differentiate into T,,17 
cells’. In their present work, Sano et al. extend 
this observation to show that SAA actually 
induces committed T,,17 cells to produce 
more IL-17, indicating that SAA modifies the 
effector functions of these cells rather than 
their commitment to a particular lineage. The 
authors support this idea by studying mice 
engineered to lack SAA — although these mice 
have normal numbers of committed T,,17 cells, 
the cells make less IL-17 than cells from wild- 
type mice. 
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Sano et al.” show that SAA spurs production of the cell-signalling molecule IL-17 
by differentiated T,,17 cells. The authors also unravel a regulatory circuit, 
involving other subepithelial immune cells, that governs epithelial Saa 
expression in this context. The circuit includes IL-23 (probably produced by 
dendritic cells in response to bacterial signals), which stimulates innate lymphoid 
cells to produce IL-22. IL-22 then induces Saa expression in epithelial cells. 


Sano et al. also unravel details about the 
molecular circuitry that regulates SAA pro- 
duction. They show that SFB bolster SAA 
expression through a roundabout route involv- 
ing innate lymphoid cells (ILCs), a group of 
immune cells that are related to T cells but 
distinct in that they lack receptors for specific 
antigen molecules. The authors found that SFB 
prompt ILCs to produce IL-22, which in turn 
elicits SAA production by gut epithelial cells 
(Fig. 1b). Interestingly, a similar regulatory 
circuit governs the expression of key anti- 
microbial proteins that are lobbed like hand 
grenades from the outward-facing epithelial 
surface to minimize the attachment of both 
foreign and resident bacteria’. Sano and col- 
leagues’ findings suggest that this circuit not 
only regulates the response to bacterial attach- 
ment, but also readies subepithelial immune 
cells to deal with potential invaders. 

The findings raise several fascinating 
questions. For example, how do epithelial cells 
sense a physical interaction with bacteria? 
Atarashi et al. provide tantalizing preliminary 
evidence that this might involve SFB triggering 
rearrangements of actin (a structural protein) 
that alter transcription of the Saa gene. And 
how does SAA activate IL-17 expression? SAA 
may act on T),17 cells through a cytokine-like 
mechanism. An alternative possibility stems 
from the finding that SAA binds to retinol 
(vitamin A)*, which suggests that SAA could 
influence immune-cell development by 
delivering retinol to the cells. 

A key motivation behind research on gut 
bacteria is to harness their functional properties 
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to improve human health. The present studies 
suggest the possibility of using tightly adherent 
but non-invasive bacteria to deliberately nudge 
the human intestinal immune system towards 
a more activated state. Such an approach could 
be particularly useful in preventing or treat- 
ing infectious disease. Although SFB do not 
colonize humans, Atarashi et al. have taken a 
step in this therapeutic direction by identify- 
ing a group of 20 human-gut bacteria that, like 
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SEB, adhere to the intestinal surface and induce 
T17-cell accumulation in mice. These findings 
should inspire further investigation into how 
adherent non-invasive bacteria influence the 
human immune system. = 
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Sometimes ‘y-rays 
come in twos 


Breakthrough measurements of a rare decay process in an excited barium 
nucleus pave the way for the development of techniques that probe the structure 
and decay modes of atomic nuclei. SEE LETTER P.406 
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he excited quantum states of atomic 

nuclei predominantly decay through 

the emission of short-wavelength, 
energetic electromagnetic radiation known 
as y-rays — an excited nucleus emits a single 
y-ray photon to reach a lower energy state. 
The energy radiated is equal to the difference 
in the energies of the initial and final quan- 
tum states. But on page 406 of this issue, Walz 
et al.' report the first observation of a rare 
decay mode, for an excited quantum state ofa 
barium nucleus ('*’Ba), that proceeds through 
the simultaneous emission of two y-ray 
photons. Because this quantum state of '*’Ba 
usually decays through the much more com- 
mon emission of a single y-ray photon, Walz 
and colleagues’ laboratory measurements 
represent a formidable exercise in finding a 
needle in a haystack, and have relevance to 
the theories of quantum electrodynamics and 
nuclear structure. 

The yy-decay was first described by the 
physicist Maria G6ppert-Mayer on the cusp 
of the 1930s. In her doctoral thesis, she pre- 
dicted”” that this rare electromagnetic decay 
mode is allowed in the framework of quantum 
electrodynamics: the energy of each of the two 
emitted y-ray photons can take a value from 
a continuous energy distribution, so that the 
total energy equals the energy liberated as the 
nucleus undergoes the quantum transition 
from the initial to a final state. 

It was not until the mid-1980s that this 
rare process was verified experimentally 
for nuclei. These pioneering measurements 
involved observations of the yy-decay modes 
of excited states of calcium (*°Ca; ref. 4), 
zirconium’ (Zr) and oxygen’ ('°O) nuclei, 


and required a clever trick. Both the ground 
and the first excited states of these nuclei have 
zero total angular momentum. The fact that 
a y-ray photon (or any photon) cannot have 
zero angular momentum means that the decay 
of one of these excited states through single 
y-ray emission is strictly forbidden because 
of the requirements of the conservation of 
angular momentum. This realization allowed 
researchers to eliminate the usually dominant 
single y-decay mode from consideration, and 
left processes that are experimentally easy to 
identify (such as conversion electron emission 

and internal pair for- 


The authors mation) as the only 
determined other competing 
that the odds decay modes*”. 

of yy-decay Although yy emis- 
occurring sion from nuclei has 
were only so far been a rare 
1in 487,805. beast, the corre- 


sponding two-pho- 
ton emission following electronic transitions 
in atoms was measured and studied several 
decades ago. This is because the atomic pro- 
cess has advantageous energetics: the lower- 
energy photons (such as X-rays) produced are 
more easily absorbed by one detector than are 
nuclear y-rays, which more frequently scat- 
ter from one detector into another. Atomic 
physicists were also aided by the experimental 
tools to hand. For example, lasers can be used 
to ‘pump’ an atom into an excited atomic state 
that is a candidate for two-photon decay or that 
decays to such a state. 

As early as the 1940s, the physicists Gregory 
Breit and Edward Teller conjectured that two- 
photon emission is the most probable radia- 
tive-decay mode of the metastable 2s,,, state 
of atomic hydrogen’, for which single-photon 
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emission is forbidden by angular-momentum- 
conservation arguments. This turned out to 
have astrophysical consequences: the two- 
photon decay of hydrogen and hydrogen-like 
atomic systems contributes to the strength of 
the continuum spectra of planetary nebulae at 
ultraviolet wavelengths’. The reverse process 
— the simultaneous absorption of two photons 
during the excitation ofan atom ora molecule 
— has even broader impact: it forms the basis 
of two-photon laser-scanning fluorescence 
microscopy’, which is used in the biomedical 
fields and in tissue engineering’. 

Now, some 30 years after the pioneer- 
ing measurements of yy-decay in transi- 
tions of '°O, “Ca and *°Zr nuclei, Walz and 
co-workers have detected, for the first time, 
the yy-decay of a nuclear quantum transi- 
tion that competes with the allowed single 
y-decay. The authors faced two experimental 
challenges: first, the former process is at least 
100,000 times less probable than the latter one. 
Second, the y-ray emitted in single y-decay 
easily scatters off electrons in the detector 
material (a process known as Compton 
scattering), so that some of its energy is depos- 
ited in one detector and the rest in another, in 
a way that mimics the signature of the elusive 
yy emission. 

In an experimental tour de force, Walz et al. 
used an established y-ray calibration stand- 
ard, the '*’Ba nucleus in an excited state that is 
created when a caesium nucleus ('°”Cs) under- 
goes a process known as B-decay. When the 
excited state of Ba decays, it predominantly 
emits a 662-kiloelectronvolt y-ray. To catch 
the rare, simultaneous yy events, the authors 
surrounded the y-ray source with five scin- 
tillation detectors (see Fig. 1a of the paper’) 
that could register y-rays at a time resolution 
better than 1 nanosecond. This fine resolu- 
tion crucially helped the authors to reject the 
impostor, scattered single-y emissions, which 
have different flight times from the true simul- 
taneous yy events. Furthermore, the authors 
used massive lead shielding to minimize 
the scattering of y-rays from one detector 
into another. 

After more than 50 days of measurements, 
Walz et al. reported a clear signal at the 
expected total energy (662 keV). It was regi- 
stered simultaneously by detector pairs from 
various angles, thus yielding the first positive 
signal of yy-decay in competition with single-y 


emission. The authors determined that the 
odds of the former decay occurring were only 
1 in 487,805, underlining the accomplishment 
in isolating such a feeble signal from a multi- 
tude of much more probable processes. 

Other groups have been trying for several 
years to detect and characterize the yy-decay of 
’Ba through experimental'*” and theoreti- 
cal’° work, but Walz and co-workers have made 
the first breakthrough. One of the remaining 
challenges is to measure precisely the energy 
distributions of the two emitted y-rays at vari- 
ous emission angles. 

Although Walz and colleagues’ measure- 
ment provides a rare confirmation of an elu- 
sive nuclear process, yy emission itself could 
be used to probe the structure of atomic nuclei. 
The yy-decay of a nucleus occurs through a 
multitude of virtual intermediate nuclear 
states. The corresponding transitions that 
occur as a nucleus passes from its initial state 
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to a virtual state, and from there to the final 
ground state, can be calculated by models 
that aim to describe the structure of nuclei. 
The characteristics of yy emission can there- 
fore provide a tracer of the combined prob- 
ability that a nucleus can transition through 
the multitude of possible pathways — these 
can never be fully measured in experi- 
ments that attempt to excite all individual 
intermediate states. 

The previously measured probabilities of yy 
transition between two states that each have 
zero total angular momentum are related to 
gross properties of the nucleus, such as electric 
polarizability and magnetic susceptibility’. In 
the case of the more-complex decay process 
studied by Walz et al., developing a compre- 
hensive theoretical model is a challenge. Walz 
and colleagues’ data may help by acting as a 
benchmark against which theoretical nuclear 
wavefunctions of '*’Ba can be compared. m 


Deep brain stimulation 
for Rett syndrome 


Mutations in the gene MECP2 cause an intellectual-disability disorder called 
Rett syndrome. In a mouse model, electrical stimulation of deep brain regions 
is found to ameliorate some of the features of the syndrome. SEE LETTER P.430 


STUART R. COBB 


hildhood intellectual-disability 
disorders typically have a genetic 
cause and are notoriously resistant to 
treatment. In particular, there are almost no 
therapies that can alleviate deficits in core 
cognitive features such as learning, memory, 
concentration and communication. So far, the 
search for treatments has been dominated by 


Hippocampus 


Fimbria—fornix 


Figure 1 | Stimulating learning. a, Hao et al.’ treated mice that model 
Rett syndrome — an intellectual-disability disorder — with a procedure 
known as deep brain stimulation (DBS). The authors implanted 
electrodes into a nerve-fibre bundle in the brain called the fimbria—fornix, 
which connects the hippocampal regions responsible for some forms of 


drug-based’ and, toa lesser extent, gene-based 
therapies (see ref. 2, for example). However, 
on page 430 of this issue, Hao et al.’ provide 
evidence that electrical stimulation of a deep 
brain region can effectively reverse impaired 
learning in a mouse model of Rett syndrome 
— one of the leading causes of intellectual dis- 
ability in girls. 

Deep brain stimulation (DBS) has been 
used to treat various brain disorders, including 


b Hidden platform 


No DBS 


receive DBS. 
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Parkinson's disease, chronic pain, obsessive- 
compulsive disorder and clinical depression’. 
Its use is well established in movement disor- 
ders that are otherwise resistant to treatment, 
but it remains highly experimental for most 
other neurological and psychiatric conditions, 
and has rarely been used in children. Using 
DBS entails implanting thin wire electrodes in 
the brain, to a depth and location determined 
by the symptoms being treated. An electrical 
device then delivers controlled impulses to 
stimulate local brain activity in a regulated 
fashion. However, the precise mechanism by 
which DBS works remains largely unknown. 
Experimental evidence suggests that DBS 
may improve cognition in adult rats with 
memory impairment’, and can slow the rate 
of cognitive decline in people with Alzheimer’s 
disease®. But Hao and colleagues’ report marks 
the first demonstration that DBS might have 
the potential to tackle a childhood intellectual- 
disability disorder. The authors focused on Rett 
syndrome, which lies towards the more severe 
end of the intellectual-disability spectrum and 


DBS 


memory, both with each other and with deeper brain structures. They then 
stimulated this region electrically. b, Using DBS enhanced hippocampus- 
dependent learning and memory in Rett-syndrome mice, enabling treated 
mice to find a hidden platform more efficiently than those that did not 
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is almost always caused by mutations in the 
MECP2 gene’. Mice that have mutations in this 
gene show intellectual disability and mimic 
other key features of Rett syndrome, includ- 
ing impairment of motor skills and breathing 
control, and other neurological effects. 

Hao et al. targeted DBS to the fimbria- 
fornix (Fig. 1), an intersection of nerve fibres 
that connects the hippocampus regions in each 
brain hemisphere (which are involved in cer- 
tain forms of memory), both with each other 
and with other brain structures. The authors 
stimulated the fimbria—fornix of mice for two 
weeks and performed sensitive behavioural 
tests three weeks later. In normal mice that 
received DBS, there was a modest improve- 
ment compared with unstimulated mice in 
forms of learning and memory that are depend- 
ent on hippocampal function. Crucially, how- 
ever, Rett-syndrome mice showed a dramatic 
improvement in hippocampus-dependent 
spatial learning and contextual fear memory. 
For both of these cognitive functions, behav- 
ioural performance in Rett-syndrome mice was 
restored to levels indistinguishable from those 
seen in normal, unstimulated animals. 

These exciting findings could have major 
implications, but caution is required when 
extrapolating from cognitive tests in mice to 
the treatment of humans with intellectual dis- 
ability. The tests used in animal models tend to 
be very specific, and it is unclear to what extent 
cognitive functions measured in rodents, such 
as ability to perform ina maze test, correspond 
to the cognitive domains that need treating in 
people. Moreover, behavioural tests in rodents 
can be poor predictors of human responses. 
This is particularly true for tests of drugs that 
enhance cognition — there is a high attrition 
rate for these drugs, because many that have 
shown promise in cognitive tests in animals 
lacked efficacy in human clinical trials. 

The authors results indicate that the effects 
of fimbria—fornix DBS are restricted to specific 
cognitive domains, with no improvement in 
other Rett-syndrome-like symptoms, such as 
altered anxiety, pain sensitivity or motor con- 
trol. This is perhaps not surprising, given the 
brain region targeted by Hao and colleagues. 
However, DBS has been effective in lessening 
these other defects, especially in motor disor- 
ders. So it is possible that altering the position 
of electrodes may ameliorate other symptoms 
of Rett syndrome. 

The next steps are to analyse the mechanism 
by which DBS improves learning in Rett- 
syndrome mice, and to determine whether 
similar effects can be expected in people with 
Rett syndrome or other forms of intellectual 
disability. The main way in which memory 
traces are thought to be encoded and stored is 
through changes in the strength of the synaptic 
connections between neurons, and disruption 
of this synaptic plasticity is a hallmark of many 
neurodevelopmental disorders. Hao et al. show 
that long-term potentiation — a particular 


type of synaptic plasticity that is required for 
certain forms of learning — is impaired in 
the hippocampus of Rett-syndrome mice but 
is boosted by DBS. Furthermore, DBS also 
enhanced the generation of neurons from stem 
cells that are resident in the hippocampus. 
Whether these effects are simply biomarkers 
of stimulation in hippocampal neural cir- 
cuits, or whether they truly contribute to the 
procognitive action of DBS, awaits further 
experimentation. 

Hao and colleagues’ study is meaningful 
because finding effective treatments for the 
core symptoms of childhood intellectual- 
disability disorders is one of the great unmet 
medical challenges in contemporary neuro- 
science. Genetic studies indicate that a multi- 
tude of molecular abnormalities can cause 
intellectual disability’, which probably explains 
the prevalence of these disorders and makes 
finding treatments so difficult — it is hard to 
envisage a drug therapy that would be effective 
for many forms of intellectual disability. 

Although DBS is invasive, it is considered to 
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be safe and controllable*. Whether or not DBS 
can be widely applied to intellectual-disability 
disorders, and whether it may one day consti- 
tute a common treatment option, remains to 
be seen. Nonetheless, the current study will 
doubtless provide impetus for future studies 
in this direction. = 
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Microscopic friction 


emulators 


Cold ions sliding across periodic energy - potential patterns formed by lasers have 
been used to elucidate the physics of dry friction between crystals. Experiments 
with no more than six ions suffice to explore a vast domain of frictional forces. 


DAVIDE MANDELLI & ERIO TOSATTI 


he study of friction is a centuries-old, 
but still vibrant, subject. The arche- 
typal friction problem of dry sliding 
between solids concerns the force that resist 
the relative lateral motion of crystal surfaces 
in contact. In this case, the periodic spacing of 
the atoms in the crystals eliminates the com- 
plexity introduced by the ill-defined nature of 
ordinary, non-crystalline surfaces. Studies of 
nanometre-scale systems through tip-based 
instruments, non-equilibrium theory, com- 
puter simulations and, most recently, the use 
of artificial friction emulators have revived 
this topic’. Writing in Science and in Nature 
Physics, respectively, Bylinskii et al.” and 
Gangloff et al.’ follow up earlier theoretical 
suggestions” °, and present friction emula- 
tors formed from trapped ions that slide over 
optical lattices — periodic energy potentials 
created by the interference of laser beams. 
Although dry sliding is a well-defined prob- 
lem, sufficient inroads have not been made 
into it. This is because suitable experimental 
systems are scarce, but also because the theory 
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of frictional dynamics is mostly restricted to 
computer simulations, which are vivid, but 
incomplete, representations of the complex 
phenomenon of friction between real solid 
interfaces. Much of scientists’ understanding 
of friction relies on apparently trivial, one- 
dimensional periodic potential models, such 
as the Prandt-Tomlinson (PT) single-slider’ 
and the Frenkel-Kontorova (FK) sliding-chain 
descriptions®. 

The PT model illustrates the switch from 
stick-slip friction — for example, that aris- 
ing from the intermittent motion of chalk on 
a blackboard — to smooth sliding as surface 
corrugation decreases. The same model also 
describes the passage from large friction at 
high sliding speeds to vanishingly small fric- 
tion (thermolubricity) at very low speeds”. In 
the special case of mismatched lattices, the FK 
model describes the transition between friction- 
less (superlubric) motion and pinned frictional 
sliding (a regime in which it takes a large force to 
dislodge the lattice and nudge it forward). 

What these models have achieved is a 
description of how properties such as corruga- 
tion, temperature, velocity and lattice-matching 


Optical lattice 


Figure 1 | Ion chain trapped ina lattice. Bylinskii et al.’ and Gangloff et al.’ investigate the friction 
that arises when one or a few ions are forced to slide across optical lattices in which a corrugated energy 
potential is created by counter-propagating lasers. These simple experiments have provided benchmark 
tests of long-standing frictional models for sliding crystal surfaces. (Adapted from ref. 2.) 


of materials in contact might influence and 
determine friction. In principle, the under- 
standing gained from the models could allow 
researchers to control frictional forces, some- 
thing desirable in many practical settings. 
However, despite the firm theoretical back- 
ground’, neither of these models has been 
tested experimentally. That is what emulators 
are meant to do. 

The emulators of Bylinskii et al. and 
Gangloff et al. are based on short chains of 
trapped ionized atoms that, under the influ- 
ence of an electric field, are forced to slide 
across a laser-generated optical lattice (Fig. 1). 
These techniques may seem arcane, yet they 
are accurate and powerful, because param- 
eters such as the temperature, atom velocity 
and spacing, chain length and the amplitude 
of the lattice’s potential can be flexibly adjusted 
across a vast range of values. 

In this vein, Gangloff and colleagues 
present experiments involving one or two 
ions that slide across an optical lattice, a 
set-up that emulates the PT model to near 
perfection. In their single-ion set-up, the 
authors demonstrate that, even at micro- 
kelvin temperatures, there is a speed below 
which the friction between the sliding ion 
and the lattice vanishes — in agreement with 
thermodynamics. For higher speeds, stick-slip 
friction ensues and rises by more than a fac- 
tor of 100 with increasing speed. No experi- 
ment involving real crystals can span this 
range of friction and speed. Although many 
of the authors’ findings had been known 
from numerical simulations’, their emulator 
is superior to those — for example, it repro- 
duces the theoretically expected dependence of 
friction on velocity'® much more accurately. 

To adequately emulate the FK model, 
one would need an infinitely long ion chain 
instead of a single ion. All that Bylinskii et al. 
use are short chains of two to six ions sliding 
across an optical lattice — so is this just a 
baby step towards that goal? Not quite. By 
adjusting the distances between the ions, the 
authors tune the amount of mismatch between 
the chain and the corrugation of the lattice’s 
periodic potential, and produce a dramatic 


effect on the friction that they measure. 

Starting from large friction for per- 
fect chain-lattice matching, Bylinskii and 
colleagues observe a rapid decrease in friction 
as they increase a parameter that controls the 
amount of chain-lattice mismatch. This trend 
reflects the evolution from strong pinning 
friction towards lubricity or even superlubri- 
city. Superlubricity, however, is reached only 
when the intensity of the lattice’s potential falls 
below the value that demarcates the transition 
(known as the Aubry transition) between 
frictionless and pinned frictional sliding". 

Studies of the Aubry transition are of inter- 
est, as has been suggested by theoretical stud- 
ies of systems with long ion chains*”. It will 
be even more interesting to study this transi- 
tion in short-chain systems such as those that 
Bylinskii et al. and Gangloff et al. use. Overall, 
it is surprising how much can be learnt about 
the physics of infinitely long systems from 
studies that involve just a few ions. 

As always, experiments teach us more than 
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we anticipate. Clear-cut techniques, such as 
the cold-ion emulators reported in these two 
papers, provide insights into the complexity 
that underlies even the simplest act of friction 
involving a handful of ions. The physicist 
Philip Warren Anderson once said!’, “more is 
different” But in the case of the current papers, 
one could counter that dictum by saying that, 
sometimes, less can be different after all. m 
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Visible inequality 
breeds more inequality 


Experiments suggest that when people can see wealth inequality in their social 
network, this propels further inequality through reduced cooperation and 
reduced social connectivity. SEE LETTER P.426 


SIMON GACHTER 


societies’. Like most important social 

phenomena, it is a complex issue that 
has many interacting sources and conse- 
quences’ *. To understand inequality and 
its dynamics over time, multiple theoreti- 
cal and empirical approaches are necessary. 
In this issue, Nishi et al.* (page 426) use 


[csi is a growing concern in many 


laboratory-style experiments (conducted 
online) to study how the visibility of wealth 
inequality in people’s social environment 
shapes the behavioural dynamics of inequal- 
ity. The attraction of an experimental approach 
is that it allows the control of factors that 
are inherently uncontrollable in naturally 
occurring data. Crucially, for example, the 
experimenter can control the initial level of 
inequality and see how inequality evolves as a 
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function of people's behaviour alone”®. 

Nishi and colleagues’ experimental model 
used an assessment of people’s willingness 
to contribute to public goods to test how 
initial wealth inequality and the structure 
of the social network influence the evolu- 
tion of inequality. The researchers were par- 
ticularly interested in the role of visibility 
of wealth — can mere observation of your 
neighbour's wealth lead to more inequality 
over time, even if such information does not 
change economic incentives? Visible wealth 
might have a psychological effect by trigger- 
ing social comparisons and thereby influenc- 
ing economic choices that have repercussions 
for inequality’. 

In their online laboratory, the researchers 
endowed all participants with tokens, worth 
real money. The endowment differed across 
individuals and treatments: in a treatment 
without inequality, all participants initially 
received the same number of tokens; in a low- 
inequality treatment, participants had similar 
but different initial endowments; and in the 
high-inequality treatment there was a substan- 
tial starting difference between participants. 

The groups typically comprised 17 people 
arranged at random in a social network in 
which, on average, about 5 people were linked 
(‘neighbours’). In each of the 10 rounds of the 
following game, participants had to decide 
whether to behave pro-socially (‘cooperate’) 
by reducing their own wealth by 50 tokens 
per connected neighbour to benefit each of 
them by 100 tokens, or to behave pro-selfishly 
(‘defect’) by keeping their tokens for them- 
selves. These decisions had consequences 
for accumulated wealth levels and inequality. 
At the end of each round, the subjects learnt 
whether their neighbours had cooperated or 
defected and 30% of participants were given 
the opportunity to change their neighbour, 
that is, to either sever an existing link or to 
create a new one. 

A crucial manipulation in this experiment 
was wealth visibility. Under invisible condi- 
tions, the participants could observe only their 
own accumulated wealth. Under visibility, they 
could see the accumulated wealth of their con- 
nected neighbours but not the whole network. 
Thus, there were six conditions in total: three 
levels of initial wealth inequality in each of the 
two visibility conditions. 

The results are complex but illuminat- 
ing. The authors find that, under high initial 
wealth inequality, visibility of neighbours’ accu- 
mulated wealth increases inequality over time 
relative to the invisibility condition, although 
absolute inequality decreases over time under 
both visibility conditions. The reason for the 
relative increase under visibility is that inequal- 
ity drops only moderately, whereas under invis- 
ibility the reduction in inequality is substantial. 
By contrast, in the case of initial wealth equal- 
ity, inequality increases — similarly in both 
visibility conditions. Under moderate initial 


Figure 1 | Wealth on display. Nishi e¢ al.’ use an 
experimental game to show that, when people can 
see the wealth of others whom they are linked with 
in a social network, inequality increases and the 
number of social connections decreases. 


inequality, visibility leads to a small increase 
in inequality relative to invisibility. 

Visibility of wealth also leads to lower social 
welfare, as measured by overall wealth (Fig. 1). 
By the end of the experiment, total accumu- 
lated wealth was substantially larger in the 
three conditions with invisible wealth than in 
the three conditions with visible wealth. The 
reason for this is that cooperativeness was 
lower under the condition of visible wealth 
compared to invisible wealth, and there were 
fewer links in the social network. 

The most striking insight from these find- 
ings is the effect of wealth visibility on the 
dynamics of inequality: conspicuous inequal- 
ity breeds more inequality. Although visibility 
of wealth does not change economic incentives 
in this experimental scenario, it invites social 
comparisons that, for various reasons*” worth 
exploring further, undermine cooperation and 
diminish social ties. This observation adds to 
existing®’, but sparse, evidence that public 
information about individual pay-offs leads 
to more competition, which in a public-goods 
setting triggers more ‘free-riding’ by indi- 
viduals (defecting when others cooperate), to 
improve their own pay- offs. 

Nishi and colleagues’ findings raise several 
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intriguing methodological questions for future 
studies. For example, how much influence does 
the social network and its rewiring have on the 
main results of this experiment? Modelling 
interactions using a social network is certainly 
realistic, but is it crucial for the emergence of 
visibility effects in inequality? Another ques- 
tion concerns the result that visibility of wealth 
matters much less under initial equality of 
wealth. This is surprising, given that inequal- 
ity of wealth increases over time and visibility 
effects should kick in, according to the results 
from the treatments with initial inequality. It 
is possible that these experiments, which used 
only ten iterations, might have been too short 
to allow for visibility effects arising as inequal- 
ity grows. 

The results also suggest substantive questions 
worthy of further research. As well as under- 
standing the role of visibility of wealth (or 
pay-offs more generally) for cooperation, it 
would be interesting to gather evidence about 
how people’ pro-social attitudes are affected 
by the ever-increasing amount of information 
about other people’s consumption (as a signal 
of their wealth)", which nowadays is spread 
onan almost global scale by social media. And 
how do visibility and social comparisons affect 
the dynamics of inequality when the relevant 
game is not one of cooperation but of com- 
petition? This is interesting because, in many 
interactions in our modern societies, not only 
initial endowments (wealth) matter but also 
resources that are allocated as people compete 
for scarce rewards — good jobs, for instance”. 

These are just some questions that can be 
investigated with the experimental model 
put forward by Nishi and colleagues. Their 
most general contribution is to showcase the 
power of experiments to contribute to our 
understanding of the behavioural dynamics 
of inequality. m 
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discovery in human disease. Although the interpretation 
and definition of clinically relevant genetic variation 
remains a challenge, established examples are being 
used to stratify subgroups of patients to identify optimal 
treatments. Precision medicine is emerging as a natural 
extension that integrates research disciplines and clinical 
practice to build a knowledge base that can better guide 
individualized patient care. 

Ata crucial time when efforts such as the UK 100,000 
Genomes Project and the US Precision Medicine 
Initiative seek to scale up population-based genome 
sequencing and integrate it with clinical data, we present 
this collection of reviews that assess progress towards 
the implementation of precision medicine. Anew 
framework is required to bring together researchers, 
clinical laboratories, clinicians and patients in what 
Samuel Aronson and Heidi Rehm term a “precision- 
medicine ecosystem”. 

Tailoring treatment to the patient is central: for this, 
Mary Relling and William Evans examine progress 
in pharmacogenomics and Luigi Naldini describes 
developments in gene therapy. Drug-development 
pipelines are benefiting from genomics in target- 
validation approaches that promise improved success 
rates and reduced costs, whereas clinical trials need to be 
redesigned to match the right trial to the right patient, as 
proposed by Andrew Biankin and colleagues. 

For the next steps, the need to share data is greater than 
ever. The Global Alliance for Genomics and Health offers 
innovative solutions that facilitate research and clinical 
diagnoses while maintaining the privacy of sensitive 
data. Also essential will be a continually updated 
knowledge base to aid the interpretation of genetic tests. 
Although the challenges might seem formidable, we are 
encouraged by the early examples discussed here and are 
optimistic for the realization of precision medicine. 

Nature is pleased to acknowledge the financial support 
of AstraZeneca in producing this Insight. As always, 
Nature carries sole responsibility for all editorial content. 
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Building the foundation for 
genomics in precision medicine 


Samuel J. Aronson’? & Heidi L. Rehm!**° 


Precision medicine has the potential to profoundly improve the practice of medicine. However, the advances required will 
take time to implement. Genetics is already being used to direct clinical decision-making and its contribution is likely 
to increase. To accelerate these advances, fundamental changes are needed in the infrastructure and mechanisms for 
data collection, storage and sharing. This will create a continuously learning health-care system with seamless cycling 
between clinical care and research. Patients must be educated about the benefits of sharing data. The building blocks for 
such a system are already forming and they will accelerate the adoption of precision medicine. 


he practice of medicine is an inexact science. The clinician 

assesses the patient’s symptoms and decides which tests to per- 

form to gather more data. They must determine the cause of 
the symptoms and the patient’s prognosis, whether clinical interven- 
tion is warranted and, if so, which intervention to prescribe. To do this 
effectively, the clinician might need to assess several potential courses 
of action and incorporate all that is known about the patient into his or 
her decision. Human physiology is complex. In some cases, the cause 
of the patient’s symptoms cannot be ascertained. In other cases, clini- 
cians cannot gather enough data to make a fully informed decision. The 
guesswork inherent in the practice of medicine reduces the efficacy of 
the interventions that are prescribed. 

Genetics is an important contributor to this complexity. Distinct 
genetic variants cause conditions that respond to different treatments 
yet share a similar set of symptoms. Without a mechanism to determine 
the underlying genetic cause of a set of symptoms, it might not be pos- 
sible to determine which treatment will be most effective a priori. For 
instance, although there are many causes of lung cancer, only people 
who have an alteration in the gene EGFR respond to treatment with 
tyrosine kinase inhibitors’”. Similarly, many genetic lesions lead to a 
thickened heart and an increased risk of sudden cardiac death, but only 
people with mutations in the gene GLA respond to enzyme replace- 
ment therapy’ . Even when the cause of a condition is known, unrelated 
genetic variants can affect treatment efficacy by altering the way in 
which drugs are metabolized or by increasing the likelihood of adverse 
events. For example, patients who are treated with conventional doses 
of the immunosuppressive drug azathioprine for an extended period are 
at risk of developing life-threatening myelosuppression if they harbour 
genetic variants that prevent the drug from being properly metabolized’. 
And approximately 6% of European populations carry HLA-B alleles 
that predispose them to potentially life-threatening hypersensitivity 
reactions if they are treated with the antiretroviral drug abacavir’. 

Understanding the patient’s genetic make-up is crucial for providing 
optimal care for many diseases. Clinicians now have access to an increas- 
ing array of tests that allow them to determine which genetic variants 
exist in their patients. These include: genotyping tests that look at vari- 
ants in a patients DNA sequence that are known to associate strongly 
with important clinical effects; panel-based gene sequencing, which 
looks at many genes related to a specific indication to detect known and 
new variants; sequencing of the exome — all known protein-coding 


genes; and whole-genome analysis that attempts to sequence a patient’s 
genome. However, simply determining which variants are present is 
insufficient. The implications of these variants must also be determined 
for each clinical indication. This genetic understanding must then be 
considered in conjunction with other clinical data to decide the path 
that will produce the best results for the patient. 


The precision-medicine ecosystem 
The goal of precision medicine is to enable clinicians to quickly, effi- 
ciently and accurately predict the most appropriate course of action 
for a patient. To achieve this, clinicians are given tools — in the form 
of tests and information-technology support — that are both compat- 
ible with their clinical workflow and economically feasible to deploy 
in the modern health-care environment. These tools help to simplify 
the process of managing the extreme biological complexity that under- 
lies human disease. To support the creation and refinement of these 
tools, a precision-medicine ‘ecosystem’ is developing. This ecosystem 
is beginning to link clinicians, laboratories, research enterprises and 
clinical-information-system developers together in new ways. There 
is increasing hope that these efforts will create the foundation of a con- 
tinuously learning health-care system that is capable of fundamentally 
accelerating the advance of precision-medicine techniques. 
Interpretation is key to the precision-medicine ecosystem. It occurs 
at several levels. Individual variants can be interpreted in relation to 
specific indications. Sets of variants can be assessed in relation to their 
collective impact on patients. Genetic and clinical data can be com- 
bined to determine the best course of action for a patient. The quality 
of these interpretations is highly dependent on the data on which they 
are based. For this reason, research and clinical databases provide the 
foundation for precision medicine. Continuous learning in health care 
is in many ways driven by improvements to the content and structure 
of these resources. For example, the highly collaborative Clinical and 
Functional Translation of CFTR (CFTR2) project brought together 
researchers from around the world to share extensive patient data on 
CFTR variants to distinguish between pathogenic and benign lesions. 
This workis leading to more effective treatments for patients with cystic 
fibrosis®. Similarly, the Evidence-based Network for the Interpretation 
of Germline Mutant Alleles (ENIGMA) consortium has engaged a 
large collaborative network to share data on the BRCA1 and BRCA2 
genes that predispose patients to breast and ovarian cancer’. Patients 
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with pathogenic variants in these genes can now take preventive action 
through monitoring and prophylactic surgical procedures® and those 
with active breast cancer are candidates for targeted treatments””®. 
In addition, large collaborative programmes led by the US National 
Institutes of Health (NIH)-supported Clinical Genome (ClinGen) 
Resource” and the Global Alliance for Genomics and Health have 
begun to tackle the development of reliable resources for systemati- 
cally defining the pathogenicity ofall human variation through broad 
and targeted efforts. 

When optimized, the infrastructure that supports the precision- 
medicine ecosystem efficiently manages and integrates the flow of 
material, knowledge and data needed to generate, validate, store, refine 
and apply clinical interpretations (Fig. 1). Biobanks link samples with 
patient data to support discovery. Research databases record the data, 
calculations and results that provide evidence for clinical interpreta- 
tions. Clinical-knowledge-sharing networks enable the refinement of 
interpretations. Clinical laboratories and their information systems 
facilitate the consolidation of interpretations into reports and alerts. 
Electronic health records (EHRs) and associated systems help clini- 
cians to apply results, both when they are received and as the patient's 
condition and knowledge of the variants evolve. Patient-facing infra- 
structure or ‘portals’ provide individuals with access to their genetic 
data and — if appropriate — the ability to decide how they should be 
used, including whether to participate in research. At present, much 
of this infrastructure is at a very early stage of development. However, 
the infrastructural foundation for precision medicine is beginning to 
emerge. In this Review, we explore its crucial components. 


The patient viewpoint 

The role of the patient in supporting precision medicine is becoming 
increasingly important. Patients are obtaining a growing number of 
genetic results in the course of their care. Typically, clinicians involved 
in their treatment order such tests for them. However, patients are also 
now able to access direct-to-consumer testing, sometimes through the 
help of someone who is not directly involved in their care. To ensure 
that precision medicine is tailored to the unique genetic make-up of 
each patient, we must gather as much information as possible from indi- 
vidual patients. Yet there are risks associated with widespread sharing 
of patient data. To gain access to these data, researchers must actively 
engage patients, teach them about the benefits of data sharing and help 
them to weigh up the risks and benefits. This can be done by making 
the process of obtaining consent more effective. 

There are two major forms of consent that are relevant: consent for 
receiving medical treatment or procedures; and consent for releas- 
ing data or samples for use in research. In both cases, the risks and 
benefits must be conveyed to the patient. However, the conventional 
distinction is that obtaining consent for treatment focuses on benefits 
to the individual whereas obtaining consent for research focuses on 
generalizable knowledge’’. Increasingly, the line between clinical care 
and research is blurring; participation in research studies can lead to a 
direct improvement in outcome for the patient’>"’, and the continuous 
capture of clinical-care data has been proved an effective way to inform 
generalizable knowledge”. As a result, efforts are under way to ask all 
patients who enter the clinical-care setting to sign a form that permits 
their data to be used in research’*. In addition, those signing clini- 
cal genetic-testing consent forms now commonly agree to share their 
data broadly to help advance knowledge". Nevertheless, there is still a 
need for more uniform consenting processes. It is difficult to generate 
consent forms in language that is both easy to understand and robustly 
conveys the main issues associated with genetic testing. Sharing such 
language across institutions could be helpful in this context. Harmoniz- 
ing consent language across providers, laboratories and biobanks would 
make it easier to administer and adhere to those agreements. Recently, 
the Regulatory and Ethics Working Group of the Global Alliance for 
Genomics and Health published a framework for the responsible shar- 
ing of genomic and health-related data”. The group has also created 
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Figure 1 | The precision-medicine ecosystem. The precision-medicine 
ecosystem contains building blocks that optimally connect patients, clinicians, 
researchers and clinical laboratories to one another. Patients and clinicians 
access information through portals or EHRs. The ecosystem can include 
displays or CDS augmented by curated knowledge that is supplied and shared 
by multiple stakeholders. Case-level databases and biobanks receive case data 
and samples from clinical and research workflows. Researchers benefit from 
all of these information sources and also contribute to knowledge sources. 
Clinical laboratories leverage data and inform the clinical community as they 
assess genomic variation and its impact on human health. 


consent tools and policies to aid the development of standardized 
approaches to obtaining consent and that support data sharing in the 
global community. Consistent with the Global Alliance for Genomics 
and Health framework, ClinGen has developed standardized consent- 
ing approaches (http://clinicalgenome.org/data-sharing/) for use in the 
clinical-care setting, which will enable sharing of genetic-test results 
and accompanying phenotypic data in the absence of research-study 
enrolment. 

Some patients are extremely interested in supporting research and 
are willing to take proactive steps to facilitate the sharing of genetic 
information. The Global Network of Personal Genome Projects recruits 
volunteers who are prepared to share their genomic data and medical 
histories publicly. ClinGen manages the GenomeConnect patient por- 
tal, built on the Patient Crossroads platform, which allows individu- 
als to share health and genetic information to form communities. The 
Platform for Engaging Everyone Responsibly (PEER), supported by 
the Genetic Alliance, enables individuals to control sharing, privacy 
and access preferences for their health and genomic data with a high 
degree of precision. 


The clinician viewpoint 

Clinicians gain access to patients’ genetic information through tests. 
Tests have two components: a technical component that focuses on 
identifying which variants are present in the patient; and an interpretive 
component in which the implications of identified variants are assessed. 
In most scenarios, genetic testing is performed to determine either 
the cause of a specific indication or the most appropriate treatment”. 
However, exome and genome data can be reused to perform multiple 
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Figure 2 | Stages of the genetic interpretation process. Once genetic variants 
have been identified, they are filtered to select those of interest (step 1). Next, 
the evidence for each variant is assessed to determine the variant’s clinical 
impact (step 2). One or more assessed variants are then interpreted with 
respect to the specific condition for which the patient is being investigated 
(step 3). Last, the overall genetic assessment is placed into the patient’s clinical 
and personal context to inform the clinical-care decision-making process 
(step 4). 


assessments over time. This opens up the possibility of obtaining and 
storing genome and exome sequences before disease manifests, with the 
intention that they will be interpreted and reinterpreted as indications 
arise. Irrespective of when the sequence is obtained, the interpretation 
step is crucial. 

Clinical interpretation is a multiple-component process (Fig. 2). After 
a pool of variants have been identified, step 1 determines which of those 
variants should be assessed further. In step 2, the clinical impact of each 
of those variants is assessed. In step 3, the relevance of the combination 
of variants identified in step 2 are considered in relation to the patient’s 
indication. Finally, in step 4, the test results are placed within the context 
of all known information about the patient to determine clinical care. 
Many laboratories base their reports on steps 1-3, although some simply 
report the variants they have identified. The patient’s clinician often 
reviews the laboratory analysis and completes any remaining steps. 
These processes can be time consuming and therefore expensive. They 
also involve a considerable amount of professional judgement, which 
makes them subject to human error and differences of opinion. The 
quality and efficiency of these processes is highly dependent on the 
data available in clinical and research databases. For this reason, these 
databases are in many ways the core of the ecosystem that is needed to 
advance the practice of precision medicine. In addition, more standard- 
ized ways to evaluate evidence, such as those released by the American 
College of Medical Genetics and Genomics”, will be crucial for inter- 
preting human genetic variation in a more consistent way. Finally, the 
open sharing of clinical interpretations to distribute the labour of variant 
assessment, to identify and resolve differences in interpretation, and to 
catalogue variation for research studies is essential for improving the 
care of patients with genetic-based conditions”. 

In each patient encounter, clinicians must address several questions 
that relate to precision medicine. First, they must assess whether genet- 
ics could be relevant — and if so, order the appropriate tests. Once the 
test results are received, the clinician must determine how to apply them. 
Then they must manage the results over time. Information-technology 
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support is needed to manage the large amount of patient data and other 
information that are required to execute these processes optimally. 
EHRs and their associated systems are the main means of providing 
such support to clinicians. 


Electronic health records 

EHRs are well positioned to be the apex of genetic information-tech- 
nology support. They should serve as the clinician’s gateway to all of the 
patient’s information, including any genetic data. Information should 
be organized and displayed in a way that integrates with the clinician’s 
workflow and facilitates diagnostic and treatment decisions. EHR and 
related systems can also provide clinicians with electronic clinical-deci- 
sion support (CDS) that provides extra information about a genetic test 
or result through an e-resource or InfoButton”* that links to electronic 
resources such as websites or databases. They can also issue pre-test 
and post-test pharmacogenomic warnings that highlight potentially 
adverse interactions between drugs and specific genetic variants. Pre- 
test warnings are triggered when a clinician takes an action that should 
be informed by a genetic assessment but there is no record of the assess- 
ment being performed. Post-test alerts are triggered when an action 
is taken that may be contraindicated by a patient's genetic profile. An 
example is ordering a high dose of azathioprine for a patient with a 
thiopurine methyltransferase deficiency*. CDS systems can also alert 
clinicians when important information emerges ona patient's previously 
reported variant”. In the future, CDS systems might be able to guide 
clinicians through complex scenarios that take into account multiple 
types of patient data, including genetics. Evolving such CDS is essential 
for the formation of a learning health-care system. 

Genetic information and CDS do not necessarily have to be imple- 
mented directly into the EHR — it is possible to integrate EHRs with 
external systems”*”®. Such integration can be seamless so that clinicians 
need never know that they are working with multiple systems. Provid- 
ing genetic support through the EHR is complex” and it is currently 
unclear how much genetic functionality EHR vendors will build into 
their systems. Some have indicated that they are unlikely to store full 
genomic sequencing in the EHR, instead choosing to link to external 
genomic data stores and focus their internal functionality on managing 
test results that have been interpreted at a higher level. Irrespective of 
whether patient genetic profiles are stored in the EHR, a constellation 
of systems will need to be tightly integrated with the EHR to provide 
optimal support to clinicians. 

Displays of genetic information and CDS are often impossible to pro- 
vide without robust access to the patient’s genetic results and reports. 
The EHR and related clinician-facing systems must obtain genetic 
results from laboratory systems. This requires interfaces between 
the EHR provider and the laboratory. Creating these interfaces often 
involves establishing electronic connections that span multiple organi- 
zations and integrate systems from competing vendors. Relatively few 
such interfaces exist, largely because of the expense associated with cre- 
ating them. Generally, results are transmitted from the laboratory to the 
provider by fax, which makes it difficult to keep the results organized 
in the EHR. CDS usually relies on access to structured electronic data 
that cannot be reliably extracted from a fax. Even if a genetic result is 
recorded in a structured format, this structure is often lost when the 
result is transferred to clinicians involved in the patient’s care who oper- 
ate out of different institutions. Any results from direct-to-consumer 
testing are also unlikely to be transferred in a structured format. 

Several groups are working to promote interconnectivity that would 
enable CDS systems that incorporate genetic information. The Insti- 
tute of Medicine Roundtable on Translating Genomic-Based Research 
for Health established the Displaying and Integrating Genetics Infor- 
mation Through the EHR Action Collaborative (DIGITizE AC). 
DIGITizE AC brings together clinicians, laboratories, vendors, stand- 
ards organizations, government agencies and patient representatives 
to increase support for genetics in the EHR. The group has defined 
a set of genetics-based CDS rules that it seeks to roll out widely. This 
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will involve leveraging the frameworks of standards bodies, such as 
Health Level Seven (HL7) International, and the Logical Observation 
Identifiers Names and Codes (LOINC) database, as well as ontology 
and rule creators such as the Clinical Pharmacogenomics Implemen- 
tation Consortium (CPIC). The National Human Genome Research 
Institute (NHGRI) has also established several consortia with EHR 
working groups to investigate how genetics can be supported in the 
EHR. These include: the Electronic Medical Records and Genomics 
(eMERGE) Network”, Clinical Sequencing Exploratory Research 
(CSER) and ClinGen". 

However, the problem that these organizations are trying to solve 
is very difficult. Standardized message formats and ontologies are the 
best way to reduce the cost of establishing the laboratory-provider and 
provider—provider interfaces needed to underlie precision medicine. 
However, these standards are helpful only if they robustly account for 
the different real-world scenarios they are intended to support and are 
broadly implemented by the vendor community (Fig. 3). Developing 
such standards requires an enormous amount of input from groups that 
combine deep clinical, laboratory, vendor and information-technology 
expertise. The DIGITizE AC has found that even defining the specific 
requirements for its initial set of narrow-use cases entailed a consider- 
able amount of interdisciplinary effort. Much more work is needed to 
build truly robust, general-purpose standards. 


The clinical laboratory viewpoint 

Clinical laboratories sit at the core of the interpretative process. Ideally, 
they provide both the evidence for individual variants as well as a case- 
level report that places all potentially relevant variants in the context of 
the patient’s presentation. Laboratories that perform genome sequenc- 
ing often discover variants that they have never seen before, which must 
then be assessed. Similarly, variants that have been seen before might 
need to be reassessed as new knowledge emerges. Variant assessment 
is becoming an important factor in the cost of genetic tests. It must be 
performed by skilled individuals because errors could result in inap- 
propriate patient care. Yet we know that variants can be interpreted 
differently. As of 11 September 2015, 369 organizations had submitted 
a total of 158,668 variants to Clin Var, a National Center for Biotechnol- 
ogy Information (NCBI) database that acts as a single centralized public 
repository to which institutions can submit their interpreted variants 
as well as retrieve data from others”. At least 2,000 of these have been 
interpreted differently by submitters”. 

Laboratories and clinicians can be assisted in two ways: better access 
to variant assessments performed by other institutions using consistent 
approaches, and tools to improve and standardize the variant assess- 
ment process. 


Building clinical genomic knowledge 

Sharing variant- and gene-level assessments between laboratories and 
clinicians can increase the quality and efficiency of the variant assess- 
ment process. Multiple efforts are under way to increase the sharing of 
such knowledge”. The ClinGen programme is building an authori- 
tative central resource that defines the clinical relevance of genomic 
variants for use in precision medicine and research. The programme 
aims are to increase the rate of submission to ClinVar and to improve 
the content of Clin Var and other genomic resources through expert 
curation. ClinGen has worked together with ClinVar to create a ‘star 
system that defines the level of review for each variant that is submitted 
to ClinVar'". ClinGen working groups have been established in multiple 
clinical domains to curate gene-disease relationships and to interpret 
variants through expert consensus. 

Centralized knowledge repositories can also be created by linking 
together the infrastructure that supports different laboratories. For 
example, laboratories that use the Genelnsight Lab application” are 
able to use the system to communicate and share knowledge in real 
time. This functionality has been used to create a network called Vari- 
antWire and also supports the Canadian Open Genetics Repository 
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(COGR)* network of Canadian labs. Importantly, an organization can 
both participate in a knowledge-sharing network and contribute their 
data to ClinVar. By adopting a standardized infrastructure that helps 
to structure data for submission to ClinVar, public sharing becomes 
cheaper, more efficient and more comprehensive with respect to sup- 
plying the supporting evidence. 


Case repositories and biobanks 

Animportant driver of improvements to variant assessment processes 
is the collection and analysis of case data. Clinical and research labora- 
tories often develop case repositories. The power of these repositories is 
a function of the number of cases that they contain. Therefore sharing 
cases across institutions is beneficial. However, it is difficult to combine 
data that have been stored in information systems developed by dif- 
ferent groups. Trade-offs must be made when deciding what data to 
capture and how deeply to standardize and structure them. The amount 
of data in a case repository can be increased by allowing contributors 
to deposit heterogeneous data that are incomplete or inconsistently 
validated and may therefore be difficult to process downstream “ie 
repository developers insist on the submission of complete, validated 
and consistent data, many cases will have to be excluded. 

Several databases have been launched that share case-level data across 
broad disease areas. The NCBI's database of Genotypes and Phenotypes 
(dbGaP)* places minimal restrictions on the types of case data that can 
be submitted and therefore serves as a generalized repository. However, 
because phenotypic data are often limited, making informative use of 
the information is difficult. Similarly, the European Bioinformatics 
Institute (EBI) maintains the European Genome-phenome Archive 
for storing case-level genomic data. The International Cancer Genome 
Consortium (ICGC)*” and The Cancer Genome Atlas (TCGA)” have 
each set up large repositories of somatic cancer sequencing data. The 
American Society of Clinical Oncology (ASCO) is looking to incor- 
porate the tracking of patient outcomes to enable a learning health- 
care system in its CancerLinQ platform”. Repositories have also been 
developed through direct patient participation and span non-profit, 
academic and commercial activities. 

Access to clinical specimens associated with patient data is often 


Clinicians 


Laboratory workers 


Computer scientists 


oD @ 
ct #é«&F 
, | 


Multiple 
implementations 
linked by interface 


Figure 3 | Creating and implementing robust standards for the description 
and structuring of data in laboratory processing and patient-care systems. 
Professionals with diverse expertise interact with vendors of laboratory- 
information systems and EHR systems to iteratively design and implement 
standards that effectively enable techniques to be used in the clinic. 
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Figure 4 | Example ofa learning health-care system. Case data can be 
shared between laboratories to support variant assessment. In this example, 
the BRCA2 p.Glu1593Asp variant in case D is classified initially as being of 
‘uncertain significance’ After accessing genetic and phenotypic patient data 
from cases A, B and C, in which there are other genetic explanations for the 
clinical phenotype, the necessary evidence becomes available to classify the 
BRCA2 p.Glu1593Asp variant as ‘likely benign. 


necessary to fully inform discovery and continuous learning. The costs 
associated with collecting samples prospectively for research studies 
are enormous. However, when samples are collected in bulk and then 
placed into biobanks, which allows their reuse across studies, the costs 
decrease considerably. One of the keys to success is ensuring that strong 
consent processes are in place. Biobanks are moving from reposito- 
ries of de-identified, unconsented specimens towards fully consented 
models that allow association with longitudinal health records. This is 
another area in which universal consent language would simplify the 
infrastructure development process. Direct engagement of patients in 
this type of sharing can help participants to balance the risks and ben- 
efits. Patients must fully understand the scope of their consent and be 
actively informed and engaged as participants in the advancement of 
knowledge. The return to patients of clinically relevant results from 
research studies should also be supported because this allows patients 
to benefit directly from their data sharing. 

In addition to supporting individual case-level repositories and 
biobanks, efforts are being made to connect divergent databases. 
Launched in 2015, the Matchmaker Exchange is a centralized network 
for sharing case-level data within an international set of case-level 
repositories focused on gene discovery”. Although each database has 
its own data schema, the development of a common application pro- 
gramming interface” means that users can query genomic and pheno- 
typic data across multiple systems. This has encouraged the member 
databases to move towards implementing a common set of fields to 
facilitate effective data exchange for gene discovery. 

There is no doubt that other such efforts will emerge, particularly as 
the Genomics England and Precision Medicine Initiative programmes 
develop. Furthermore, the data-standardization efforts that help to 
establish interfaces between laboratories and providers could assist 
the development of these case repositories. Organizations that supply 
patient data to these efforts must develop mechanisms for collecting 
data more uniformly and for sharing them consistently. By enabling 
patients to contribute their data directly, the collection of phenotype 
data can be accelerated and broadened™. 


Knowledge resources and tools 

In addition to clinical-knowledge and case-sharing networks, many 
laboratories and clinicians use research-grade knowledge resources and 
tools. Many types of tools and resources are used daily in the clinical 
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workflow, even if they are not intended for direct clinical usage. In sil- 
ico assessment tools use computational algorithms to assess the likely 
effects of DNA variation*“’. Genome browsers can display multiple 
tracks of information, including species conservation data, the location 
of gene transcripts and regulatory elements, and population genetic 
variation. A number of allele-frequency databases, such as dbSNP 
Short Genetic Variations, the National Heart, Lung, and Blood Insti- 
tute (NHLBI) Grand Opportunity Exome Sequencing Project's Exome 
Variant Server, the Exome Aggregation Consortium (ExAC) Browser 
and the 1000 Genomes Project”, provide data that are used by clini- 
cal laboratories to define variation that is unlikely to cause Mendelian 
disorders. The NCBI’s PubMed database provides access to published 
biomedical literature, and public and commercialized efforts exist to 
curate and present the data contained in such literature in a more useful 
format, such as Online Mendelian Inheritance in Man (OMIM) and The 
Human Gene Mutation Database”. To define the role of these knowl- 
edge resources, users must assess which resources are useful, how their 
quality is controlled and how best they can be integrated into clinical 
workflows”. ClinGen maintains a list (http://clinicalgenome.org/tools/ 
web-resources/) of web-based tools that members of its community 
have found useful. This resource has also been designed to serve as an 
e-resource that can be accessed through EHRs. 


The researcher viewpoint 

The advent of precision medicine and its supporting infrastructure 
has given researchers the ability to influence clinical care directly. The 
release of innovative research tools and the addition of new informa- 
tion in the form of knowledge bases can immediately influence patient 
care by changing how genomic variants are assessed. Although data 
obtained through clinical settings typically require processing in specific 
ways, those obtained from research tend to be more flexible. This means 
that researchers can often tolerate more variability and inconsistency in 
their data sources than clinicians. For this reason, new infrastructure 
is usually released to the research community before it is optimized for 
clinical use. 

Clinical knowledge-sharing infrastructure and case repositories, 
especially when combined with EHR-derived content, can provide cli- 
nicians and clinical laboratories not only with unprecedented access to 
clinical data, but also make this information accessible to researchers. 
New models are emerging for the broad sharing of data for discov- 
ery purposes. For example, the crowd-sourced approach to solving 
complex biological problems taken by the DREAM Challenges is now 
being applied to clinical-trial data in the hope of advancing precision 
medicine””. Such challenges pair the brightest computer scientists with 
unprecedented open access to data to allow the development of highly 
informative models for predicting patient outcomes. To support this 
era of open data and discovery, it is crucial that appropriate consent 
approaches are established to allow clinical data to be used in these 
ways. Opportunities for discovery are also being created as the cost of 
sequencing falls and the processing capabilities of ‘big data become 
increasingly accessible*’. All these factors could contribute further to the 
formation ofa continuously learning health-care system that simultane- 
ously engages clinical-care providers and researchers and is necessary 
to support the development of precision medicine. 


Realizing continuously learning health care 

Ideally, continuous learning in health care would involve the capture 
of all incremental data, knowledge and experience gained through 
each patient interaction. This information would then be used in real 
time to improve the care of current and future patients. The ability to 
stratify patients, understand scenarios and optimize decision-making 
would consistently improve based on the myriad data obtained dur- 
ing the care-delivery process. This would be the ultimate expression 
of precision medicine. The infrastructure we discuss in this Review 
represents initial steps in this direction. We have already seen evidence 
to show that continuous-learning processes are achievable. Figure 4 
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depicts an example of a continuous-learning system in which hypo- 
thetical historical patient data from breast-cancer testing are accessed 
to determine the pathogenicity of anew BRCAI variantas ‘likely benign. 
The variant would otherwise be considered of ‘uncertain significance’ 
Clinical laboratories often classify variants on the basis of historical 
clinical case histories. Because each laboratory has access to only a frac- 
tion of patients tested, optimal learning can only happen when data are 
shared broadly between organizations. ClinGen has made advances in 
clinical-laboratory genetic-data sharing through the use of the Clin Var 
database"’. However, this level of sharing is more likely to lead to thera- 
peutic development and improved outcomes if the results of genetic 
testing are accompanied by greater amounts of patient health data. This 
will require the emerging genetic infrastructure to be extended, such 
that it can integrate with as many other forms of patient data as possible. 


Addressing barriers to precision medicine 

Multiple issues must be overcome for personalized medicine to reach 
its potential, as summarized by Joyner and Paneth in seven key ques- 
tions”. Although some doubt has been expressed that personalized 
medicine will reach its full potential for common diseases, the recent 
shift in emphasis to studies of the genetic basis of rare diseases and 
somatic cancer could provide tangible success in this field. For exam- 
ple, mechanistic understanding of rare disease and cancer pathways 
might inform the understanding of common diseases and approaches to 
reducing risk more effectively than has been achieved through genome- 
wide association studies. However, to ensure that we can learn from our 
evolving experience in the diagnosis and treatment of all types of dis- 
ease, continuously learning health-care systems and broad data-sharing 
approaches must be supported. The absence of such systems is likely 
to be responsible for the limited success of personalized medicine to 
date. The continuous-learning infrastructure could be used to adda 
testing methodology for new hypotheses, in which real-time evalua- 
tion is repeatedly conducted against a limited set of treatment decisions 
for a given condition to determine which treatments provide the best 
results for different patient subgroups. Improved decision-making — 
at present, based on access to more up-to-date knowledge, and in the 
future, based on real-time evaluation techniques — has the potential 
to partially offset cost concerns by reducing expenditures associated 
with unnecessary or ineffective care. These improvements are also likely 
to generate public-health benefits. Improved infrastructure to capture 
both test results and patient outcomes should enable the measurement 
of such benefits. 

The type and quality of patient data stored in EHRs are clearly issues 
that need to be addressed to support a continuously learning health-care 
system. In our experience, the investment required to capture higher- 
quality and more clinically relevant data is made only when a near-term 
financial return on those investments can be established. An improved 
foundational infrastructure provides an expanded basis for innovation 
and thereby facilitates the development of tools and analysis that are 
capable of justifying these foundational investments. 

Open data-sharing resources, as well as the principle of open data 
itself, can help to reduce the cost of conducting genetic research. They 
can also limit the number of conflict-of-interest problems that occur 
as academic medical centres increasingly partner with commercial 
activities. Although this infrastructure does not help to solve general- 
ized funding issues, it does set a precedent for sharing data rather than 
keeping it proprietary. In doing so, it reduces the scope and impact of 
potential conflicts and helps to ensure that commercial relationships 
are based on open principles. 


Future directions 

Despite compelling examples of the use of genomics to support preci- 
sion medicine, the core building blocks that will be necessary to scale 
up the field are still in a very primitive state. However, as the community 
works to improve these building blocks and link them up, transforma- 
tions are beginning to occur. Clinicians, researchers, laboratories and 
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vendors are working together to build the tools that will close the dis- 
tance between each stakeholder. It is becoming easier to move, compare, 
apply and reproduce knowledge, data and samples. The basic infrastruc- 
ture required to support a continuously learning health-care system has 
started to evolve spontaneously in many different areas. Furthermore, 
a cultural change is emerging as researchers, clinicians and patients 
embrace the open sharing of data to facilitate scientific advancement. 
Although it is unclear how long it will take to build an infrastructure 
that fully supports the widespread sharing and effective use of genomic 
and health data, the ultimate result will be a transformation of health 
care that allows continuous advances in medicine to occur within a 
clinical-care system that is less dependent on externally funded research 
endeavours. m 
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Pharmacogenomics 
in the clinic 


Mary V. Relling’ & William E. Evans! 


After decades of discovery, inherited variations have been identified in approximately 20 genes that affect about 80 
medications and are actionable in the clinic. And some somatically acquired genetic variants direct the choice of ‘tar- 
geted’ anticancer drugs for individual patients. Current efforts that focus on the processes required to appropriately 
act on pharmacogenomic variability in the clinic are moving away from discovery and towards implementation of an 
evidenced-based strategy for improving the use of medications, thereby providing a cornerstone for precision medicine. 


that influence drug effects, typically through alterations in phar- 

macokinetics — that is, how the drug is absorbed, distributed, 
metabolized or eliminated — or pharmacodynamics, by modifying its 
target or by perturbing the biological pathways that shape a patient's sen- 
sitivity to its pharmacological effects. In conditions apart from cancer 
and infectious diseases, the genetic variations of interest are primarily in 
germline DNA, andare inherited from a parent or are de novo changes 
that alter the function of gene products. In cancer, both inherited and 
somatically acquired variants can influence a patient’s response to 
treatments. In infectious diseases, genetic variation can affect a patho- 
gen’s sensitivity to antimicrobial drugs’. Advances in genome inter- 
rogation technology and in analytical approaches have facilitated the 
evolution of a discovery model from candidate gene studies towards 
agnostic genome-wide analyses of patient populations with specific 
drug-response phenotypes — for example, toxicity or desired pharma- 
cological effects. In fact, current technologies for genome interrogation 
are sufficiently robust that defining the drug-response phenotype has 
become the more difficult component of pharmacogenomics research. 
Once a pharmacogenomic relationship has been discovered and vali- 
dated, there are many obstacles to translating it into clinical practice. 
Such translation requires that effective, alternative therapy is available 
for those with ‘high-risk genotypes, as well as improvements to health- 
care systems, structured approaches to guide prescribing (for example, 
algorithms), and implementation of point-of-care electronic clinical 
decision support (CDS), to make it feasible to utilize genetics appro- 
priately to guide drug prescribing. 

A decade ago, we laid out a vision of how evolving genome technolo- 
gies could be deployed to facilitate pharmacogenomic discoveries’, and 
here in this Review, we extend this vision to address how these relation- 
ships can be translated into tools to optimize the use of medications in 
the clinic. 


P harmacogenomics focuses on the identification of genetic variants 


Evolution of pharmacogenomics 

The earliest origins of pharmacogenomics are unclear; in 510 Bc the 
Greek philosopher and mathematician Pythagoras reported that a sub- 
set of people who ingested broad beans (Vicia faba) experienced poten- 
tially fatal haemolytic anaemia (Fig. 1). Over two thousand years later, 
this reaction was attributed to an inherited deficiency in the enzyme 
glucose-6-phosphate dehydrogenase (G6PD), which also predisposes 
patients to haemolysis from several medications, including rasburicase 
and the antimalarial primaquine’. In 1909, while studying the common 


bean Phaseolus vulgaris, the Danish pharmacist Wilhelm Johannsen 
coined the terms genotype and phenotype, linking genotype to the 
effects of volatile organics, a presage to pharmacogenetics’*. A cluster- 
ing of drug-metabolizing enzyme activities by racial group strongly 
suggested a genetic component to population variation**. 

In 1959, the German geneticist Friedrich Vogel was the first to use 
the term ‘pharmacogenetics”, a concept that was bolstered by landmark 
studies by the pharmacologists Elliott Vesell and John G. Page showing 
that the pharmacokinetic profile of the pain-relieving drug antipyrine is 
more similar in monozygotic twins than in dizygotic twins®. The clinical 
relevance of the field was reinforced when family studies in different 
racial groups indicated that differences in the metabolism of isoniazid 
—a treatment for tuberculosis — and its side effect, peripheral neuritis, 
were inherited as an autosomal recessive trait”’°. Decades later, differ- 
ences in the metabolism of isoniazid were shown to be caused by inher- 
ited variants in the NAT2 gene, which encodes the N-acetyltransferase 2 
enzyme’. 

Other family studies conducted between the 1960s and 1980s docu- 
mented patterns of inheritance for many drug effects, which eventu- 
ally led to molecular studies that revealed the inherited determinants 
for many of the traits. In 1987, CYP2D6 became the first polymorphic 
human drug-metabolizing gene to be cloned and characterized’*. In the 
1990s, the potential clinical utility of pharmacogenomics was clearly 
illustrated for several genes'*"° , including TPMT, which encodes the 
enzyme thiopurine methyltransferase. People with an inherited defi- 
ciency in this enzyme were found to experience haematopoietic tox- 
icity on administration of the antileukaemic and immunosuppressive 
thiopurine drugs mercaptopurine and azathioprine”, although imple- 
mentation of this finding in the clinic progressed slowly at that time”. 

As in most areas of genetics, the rate of pharmacogenetic discovery 
has been accelerated by the Human Genome Project and by improved 
technologies for the genome-wide interrogation of variation. This short- 
ened the timeline for discovery and enabled agnostic genome-wide 
studies of populations of patients with specific drug-effect phenotypes, 
often leading to the identification of unanticipated genetic variants 
that were statistically associated with drug effects. These genome-wide 
strategies also helped to bring the term ‘pharmacogenomics’ into the 
pharmacology lexicon”. 

Discoveries that emerge from genome-wide or candidate-gene strate- 
gies require independent validation before they can be translated into 
clinical diagnostics. The validation process can be facilitated by eluci- 
dating the mechanisms that determine how the variation alters drug 
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Figure 1 | Inherited G6PD deficiency and haemolysis. People with a 
deficiency in the enzyme glucose-6-dehydrogenase (G6PD) can develop 
haemolytic anaemia after eating broad beans (left) or when taking the drug 
rasburicase (right). In the case of broad beans, the reaction is the result of 
the chemical moieties isouramil (R = OH) and divicine (R = NH,), and for 


responses. Genetic variants often differ according to ancestry, which can 
confound the translation of pharmacogenetic traits from one population 
to another. For instance, genetic polymorphisms in the genes CYP2C9 
and VKORC1 havea population-specific influence on the anticoagulant 
effects of warfarin”. Furthermore, it is becoming increasingly evident 
that many drug effects are influenced by multiple variants in the same 
gene — some of which are rare — and by variants in multiple genes within 
the same patient. The Genomics England 100,000 Genomes Project and 
the US National Institutes of Health (NIH) Pharmacogenomics Research 
Network are two of many ongoing efforts to facilitate genome discoveries 
and their translation into diagnostics. Their findings may eventually be 
used to optimize the selection and dosing of medications for individual 
patients. The discovery and translation of inherited determinants of drug 
response and somatically acquired genetic variants in cancer are promi- 
nent pharmacogenomic components of these and other initiatives. 


Diagnostic testing 

Before being used in a clinical setting, genetic tests must meet certain 
criteria concerning their analytical validity, clinical validity and clinical 
utility’. Developing genetic tests with analytical validity is nontrivial 
for pharmacogenes such as CYP2D6 (ref. 21) because the gene is subject 
to germline copy number variation and the formation of hybrid gene 
fusions that are difficult to assay and interpret reliably. Clinical utility 
involves assessing whether the use of the test leads to improved health 
outcomes for patients who are subject to testing, as well as an assess- 
ment of the risks that occur as a result of testing. There is substantial 
difference of opinion as to precisely which outcomes constitute clinical 
utility””’. Some studies have broadened their scope to assess the impact 
of testing on the entire health-care system, including comparing the 
costs of genetic testing with those of other health-care interventions, as 
well as understanding how such testing can influence the behaviour of 
clinicians. For example, Hong Kong introduced a policy to screen for 
the HLA-B*1502 allele before prescribing antiepileptic drugs because 
patients with the allele are at high risk of developing severe skin reac- 
tions to the drug carbamazepine. But the policy led clinicians to forego 
prescribing carbamazepine at all and instead they began to prescribe 
phenytoin. Phenytoin can also cause severe skin reactions, but the risk 
factors are not well defined, as for carbamazepine, so the overall inci- 
dence of severe skin reactions remained unchanged™. For the purposes 
of this Review, we focus on the clinical utility of pharmacogenomic 
testing for individual patients without considering the possible public- 
health consequences of changes in prescribing behaviour. 

As the cost of sequencing continues to fall, many have predicted that 
every individual will in the not-too-distant future have their germline 
genome sequenced early in life and the results will be available for clini- 
cal use throughout their lifetime. Assuming that this prediction is real- 
ized (to some extent, at least), we call for a shift away from the debate 
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rasburicase it is the hydrogen-peroxide product of the breakdown of uric acid. 
Red blood cells of G6PD-deficient individuals — primarily those with the class 
II ‘Mediterranean’ allele of G6PD — produce insufficient NADPH to protect 
from oxidative damage caused by these moieties, which leads to haemolysis. 


Consequently, rasburicase is contraindicated in G6PD-deficient individuals’. 


over whether patients should be tested for specific pharmacogenes 
before they are prescribed specific drugs, and instead, we suggest mov- 
ing towards a model in which clinicians are provided with guidelines on 
how to interpret and deploy genetic variants to improve their prescrib- 
ing. This assumption underlies the efforts of the Clinical Pharmacoge- 
netics Implementation Consortium (CPIC)**”, an open, international 
non-profit group that creates standardized guidelines on how to use 
genomic data to inform prescribing. The guidelines are evidence-based, 
peer-reviewed and publicly available. 

A number of factors are used to determine whether there is enough 
evidence to support the analytical validity, clinical validity and clinical 
utility of a pharmacogenomic test and to warrant its use in guiding the 
prescription of medications”. The analytical validity will depend on the 
quality of the data from genetic tests as well as the test’s performance 
characteristics, such as the positive and negative predictive values. Many 
types of data can be used to evaluate clinical validity and utility, includ- 
ing the penetrance of genetic variation on drug effects, which can be 
determined from retrospective studies. The mechanism or mechanisms 
by which genetic variation influences drug effects or a relevant endophe- 
notype (an intermediate phenotype, such as drug-metabolizing enzyme 
activity) can also be used. Additionally, data can be gathered from in vivo 
pharmacokinetic or other functional studies, in vitro functional studies, 


Table 1 | Actionable germline genetic variation and associated 
medications 


Genetic variation Medications 


TPMT Mercaptopurine, thioguanine, 
azathioprine 

CYP2D6 Codeine, tramadol, tricyclic 
antidepressants 

CYP2C19 Tricyclic antidepressants, clopidogrel, 
voriconazole 

VKORC1 Warfarin 

CYP2C9 Warfarin, phenytoin 

HLA-B Allopurinol, carbamazepine, abacavir, 
phenytoin 

CFTR Ivacaftor 

DPYD Fluorouracil, capecitabine, tegafur 

G6PD Rasburicase 

UGT1A1 lrinotecan, atazanavir 

SLCO1B1 Simvastatin 

IFNL3 (IL28B) Interferon 

CYP3A5 Tacrolimus 


From ref. 44 (accessed on 7 May 2015). See ref. 44 for updates. 
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and preclinical and clinical studies that link pharmacological effects or 
drug concentrations to genetic variation. Further sources of data include 
case reports, family studies, and randomized clinical trials that compare 
the outcomes of genetics-based prescribing versus prescribing that is not 
based on genetic-test results. Other factors that are considered when 
deciding on the actionability of pharmacogenomic variation include the 
therapeutic index ofa drug (the ratio of the toxic dose to the therapeutic 
dose), the severity of the drug's toxicity, the severity of the underlying 
disease and the consequences of suboptimal prescribing. 

An important consideration for the actionability of a gene-drug rela- 
tionship is the availability of an alternative therapy, which may partly 
depend on the mechanism of the gene—drug association. If the gene 
affects the pharmacokinetics ofa drug (such as in the CYP3A5-mediated 
catabolism of the immunosuppressant drug tacrolimus), there could be 
substantial published evidence to support a dose adjustment in much the 
same way as doses are often adjusted according to age or kidney or liver 
function. Such dose-adjustment decisions are particularly defensible if 
the drug is one for which therapeutic drug monitoring (based on the 
concentration in the blood) is readily available. When tests indicate that a 
drug is either unlikely to be effective or could have unacceptable adverse 
effects in patients with a particular genotype, the recommendation for an 
alternative therapy will depend on the balance of evidence for both the 
efficacy and possible toxicity of the alternative. For example, individuals 
who are homozygous for inactive CYP2D6 alleles are unable to convert 
the analgesic drug codeine into its active metabolite, morphine, but can 
respond to several other opiate analgesics”. If genetic tests indicate an 
extremely high risk for a serious adverse event — for example, carriers of 
the HLA-B*57:01 allele have a high risk of hypersensitivity to the antiret- 
roviral drug abacavir**” — the alternative therapy should be equally 
effective but have an acceptable risk of adverse effects (that may or may 
not be influenced by other genetic variants). 

Some treatment decisions are less clear cut. For example, there are 
substantial data to show that patients with two defective CYP2D6 alleles 
are more likely to experience recurrence of breast cancer after treatment 
with tamoxifen. This is because these patients produce much lower lev- 
els of the active tamoxifen metabolite, endoxifen, than those without 
defective alleles** *°. However, it is unclear whether the best alternative 
is another drug (a different selective oestrogen-receptor modulator, for 
instance) or an altered dose of tamoxifen, particularly in premenopausal 
women for whom there is a shortage of data to support alternative treat- 
ments. Such cases are the most difficult to resolve: although clear from 
pharmacogenomic testing that the drug or drug dose is suboptimal in a 
patient with the high-risk genotype, a lack of clinical data for alternative 
therapies makes it difficult to recommend other medications. 

The CPIC considers all such evidence when deciding which gene-drug 
pairs are clinically actionable. Given the high bar for clinical actionability, 
the number of actionable genes — those with at least one actionable high- 
risk diplotype — is small, and the list of medications for which clinical 
actions are recommended (pharmacogenetically high-risk drugs) is rela- 
tively short (Table 1). There are also additional medications that include 
pharmacogenomic information in their labels****. However, not all are 
actionable. Sometimes, information on genetic variation is included when 
the effects are modest and therefore do not require changes to be made 
to the prescribing section of the drug label. This information has also 
been included for some drug labels despite weak or conflicting evidence. 

At present, there are only two examples of actionable pharmacogenes 
that also carry a disease risk: the gene UGTIA1 and Gilbert's syndrome”, 
and the gene G6PD and haemolytic anaemia”. Thus, many of the ethics 
concerns that affect the clinical implementation of disease-risk genom- 
ics have less relevance for pharmacogenomics”. 


Clinical implementation of pharmacogenomics 

Genetic variants that influence the clinical effects of some medications 
can now be reliably assayed in the clinical setting. Prescribing decisions 
for such clinically actionable gene-drug pairs should be influenced by 
these genetic-test results. 
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Figure 2 | Medications affected by actionable pharmacogenes. 
Approximately 7% of FDA-approved medications are affected by actionable 
inherited pharmacogenes (left), and approximately 18% of US outpatient 
prescriptions are affected by actionable germline pharmacogenomics (right)”, 
which demonstrates that several pharmacogenetically high-risk drugs are 
commonly prescribed. 


Drugs and genes 

More than 1,200 individual molecular entities have been approved as 
drugs by the US Food and Drug Administration (FDA)”, the European 
Medicines Agency (EMA)” or by Japan’s Pharmaceuticals and Medical 
Devices Agency (PMDA)”*. Although about 15% of the medications 
approved by the FDA and EMA contain pharmacogenomic information 
on their label**”’, only a subset of the corresponding pharmacogenes is 
deemed actionable. As summarized in Fig. 2, 7% of medications have 
actionable germline pharmacogenetics. These correspond to CPIC 
level A or B gene-drug pairs“ for which genetic information should or 
could be used to change the prescription pattern of the relevant drug. In 
the United States, these medications constitute 18% of all prescriptions, 
which indicates that pharmacogenomically high-risk medications are 
slightly overrepresented in highly prescribed medications (Fig. 2)”. So 
far, only 16 of the roughly 19,000 human genes are considered to be 
clinically actionable for germline pharmacogenomics™. Most human 
germline genetic variation is unlikely to be actionable for the prescrip- 
tion of medication, and pharmacogenomics is unlikely to be useful for 
improving the prescription of the majority of drugs. However, for the 
relatively small set of medications for which genomics is actionable, 
prescribing could be optimized if genetic testing were more widely and 
appropriately deployed in the clinic. In the meantime, the number of 
such actionable gene—drug pairs continues to grow, albeit at a slow pace. 


Somatically acquired genomic variation 

Genetic variants that are specific to cancer tissue represent a special 
case of pharmacogenomics. Somatic variation can identify which 
types of malignancy are likely to respond to various anticancer 
agents***’, The recognition that cancer tissue can be distinguished 
from normal tissue by the presence of specific genomic abnormali- 
ties pre-dates the Human Genome Project. For example, in the 1980s 
and 1990s ploidy in neuroblastoma“ and cytogenetic abnormalities in 
acute lymphoblastic leukaemia” were used to determine the composi- 
tion and strength of the cytotoxic chemotherapy required to treat these 
cancers. The genetic testing of malignancies has since become more 
specific in response to the development of anticancer agents that are 
directed at — or more effective at — treating tumours that harbour 
certain acquired genetic variants (‘Table 2). Although the FDA gener- 
ally requires that diagnostics be developed alongside targeted antican- 
cer agents, the EMA is less stringent”. However, proposed changes to 
the European Union’s framework would lead to greater harmonization 
between the United States and Europe”. 
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Table 2 | Actionable somatic genetic variants in cancer cells and associated medications 


Genetic abnormality HGVS nomenclature* Targett Medications Disease 
AKT mut (act) p.Glul7Lys mTOR Sirolimus, everolimus RCC 
BCR-ABL (SV) t(9;22) (q34.1;q11.21) ABL Imatinib, dasatinib CML, Ph* ALL 
BCR-ABL (SV, mut) p.Val299Leu ABL Bosutinib, nilotinib Imatinib-resistant CML 
BCR-ABL (T135l) p.Thr135lle ABL Ponatinib CML, Ph* ALL 
BCR-ABL (SV) t(9;22) (q34.1;q11.21) SRC Dasatinib CML, Ph* ALL 
BRCA1/2 variants Too numerous to list PARP. Olaparib Ovarian cancer 
BRAF SNVs (V600E/k) p.Val600Glu, p.Val600Lys, BRAF Dabrafenib, vemurafenib Melanoma 
p.Val600Asp 
BRAF SNVs (V600) p.Val600Glu, p.Val600Lys, MEK Trametinib Melanoma 
p.Val600Asp 
EGFR (ex 19 del, SNVL858R) — p.Glu746_Ala750Odel, EGFR Afatinib, erlotinib SCLC (EGFR*) 
p.Leu858Arg 
EGFR mut (act, amp) p.Glu746_Ala750del, EGFR Gefitinib SCLC (EGFR’*) 
p.Leu858Arg 
EGFR* and WT KRAS N/A EGFR Cetuximab, panitumumab EGFR* colon cancer (WT KRAS) 
EML-ALK (SV) inv(2)(p21p23) ALK Crizotinib SCLC 
FLT3 CNV (amp) p.D600_L601linsFREYEYD, FTL3 Sunitinib, sorafenib AML 
p.Asp835Tyr 
HER2 (amp) N/A ERBB2 Lapatinib, trastuzumab HER2* breast cancer 
KIT mut (act) p.Trp557_Lys558del, KIT Imatinib, sunitinib RCC, GIST 
p.Asp579del, p.Val559Asp 
PDGFR (mut or SV) p.Asp842Val PDGFR Sunitinib, imatinib RCC, GIST, pancreatic cancer 
PI3K (mut or amp) PIK3CA p.Glu542Lys, PI3K Idelalisib CLL, NHL 
p.Glu545Lys; p.His1047Arg, 
p.His1047Leu 
RARA (SV, gene fusion) t(15;17)(q24;q21) RARA Tretinoin, alitretinoin APL, CTCL, Kaposi sarcoma 
RARA (SV, gene fusion) t(15;17)(q24;q21) RARA Arsenic trioxide APL 
SMO mut (act) p.Trp535Leu, p.Arg199Trp, Smoothen Vismodegib Basal cell carcinoma 
p.Arg562GIn 
VHL (mut) Too numerous to list VEGFR Sorafenib RCC, hepatic cancer, thyroid 
cancer 
VEGF (mut) N/A VEGF Ziv-aflibercept Colon cancer 


Medications targeting normal cell surface proteins that are expressed on some tumour cells (for example, ER, PR, CD20, CD30, CD52) are not included in this summary of drugs that target proteins with 
aberrant expression or function due to somatic genetic variants. Act, activating; ALCL, anaplastic large cell lymphoma; ALL, acute lymphoblastic leukaemia; AML, acute myeloid leukaemia; amp, amplification 
(typically by CNV); APL, acute promyelocytic leukaemia; CLL, chronic lymphocytic leukaemia, CML, chronic myeloid leukaemia; CNV, copy number variant; CTCL, cutaneous T-cell lymphoma; del, deletion; 
ER, oestrogen receptor; ex, exon; GIST, gastrointestinal stromal tumour; HGVS, Human Genome Variation Society; ins, insertion; mut, mutation; N/A, not applicable; NHL, non-Hodgkin lymphoma; NSCLC, 
non-small-cell lung cancer; Ph, Philadelphia chromosome; PR, progesterone receptor; RCC, renal cell carcinoma; SNV, single nucleotide variant; SV, structural variant; WT, wild type. 


*Only representative examples of known mutations are shown. 


tIn general, targets are protein products encoded by the gene listed. 


Reactive and pre-emptive testing 

Initially, clinical testing was deployed through single-gene pharma- 
cogenetic tests — a practice that evolved from the coupling of strong 
‘monogenic gene—drug associations with limitations in genotyping 
technology”. In this model, genetic tests are ordered one ata time ona 
reactive basis: if the patient might need a pharmacogenetically high-risk 
drug, the clinician orders the applicable test. However, improvements in 
technology mean that it is now possible to interrogate multiple genes in 
a single assay at lower cost than for multiple single-gene tests. 

Most human diseases, including cancer, are influenced by multiple 
genes and genetic variants. Likewise, the pharmacokinetics and phar- 
macological effects of most medications are determined by multiple 
gene products, such as drug-metabolizing enzymes, drug-transporting 
membrane proteins, drug targets and disease-modifying genes. Many 
of the actionable pharmacogenes that have been identified to date exert 
a strong effect on the pharmacokinetics or pharmacodynamics of their 
associated drug, which make them easy to identify. The strong effects 
of the polymorphism in the gene TPMT on the risk of haematopoietic 
toxicity from thiopurine medications are an excellent illustration of how 
‘low-hanging fruit’ are often merely the first step down a polygenic path. 
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Subsequent studies showed that when the dosage of mercaptopurine is 
adjusted in response to TPMT testing, polymorphisms in other genes 
— suchas ITPA — begin to surface as important”. Furthermore, poly- 
morphisms in other genes that form part of the same pharmacological 
pathway can emerge as being significant for populations of a different 
ancestry. An example of this is the strong influence of an inherited vari- 
ant found in the gene NUDT 15 on thiopurine toxicity. NUDT15 vari- 
ants are extremely uncommon in individuals of European and African 
ancestry, but are relatively common in people of Asian descent™. This 
explains the high frequency of thiopurine intolerance in Asian popula- 
tions despite the low frequency of TPMT variants. When a genome-wide 
association study of thiopurine intolerance was performed in a popu- 
lation that comprised people of European, Asian, African and Native 
American ancestry, germline variants in both TPMT and NUDT15 
reached genome-wide significance for their association with thiopurine 
intolerance*’. TPMT variants were revealed to be the major determinant 
of the tolerated dose of thiopurine medications in patients of European 
and African ancestry, whereas NUDT15 was the major genetic deter- 
minant in patients of Asian and Native American ancestry. Moreover, 
the metabolism and effects of anticancer agents, including thiopurines, 
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can be affected by both germline and somatic genetic variation”, which 
further increases the complexity of cancer pharmacogenomics. 

Several other examples exist in which more than one gene is clini- 
cally actionable for a given medication. These include the anticoagulant 
warfarin, which is affected by both CYP2C9 and VKORCI (ref. 19), 
and tricyclic antidepressants, which are affected by both CYP2C19 and 
CYP2D6. Given that a single gene can affect more than one medication 
(Table 1), there are potential benefits of genotyping a panel of phar- 
macogenomic variants that apply to a number of drugs that a patient 
might receive in their lifetime. There is increasing evidence to show that 
genotyping multiple genes in a single assay is more cost-effective, uses 
the DNA in the sample more efficiently, and facilitates the pre-emptive 
availability of genetic-test information. Such multigene panels can 
change practice from a reactive approach (in which a fresh genetic test 
is ordered every time it is required) to a pre-emptive approach (in which 
a single sample is assessed for many likely-to-be actionable genes at the 
same time), thereby providing the patient with a lifetime's worth of test 
results. Several groups have already begun to implement pre-emptive 
multigene panels for pharmacogenomics”’“’, but the practice is by no 
means widespread at present. 


Clinical implementation 

A number of barriers prevent the widespread use of pre-emptive mul- 
tigene panels to guide the prescription of drugs. These include the lack 
of incentives for clinicians to conduct tests or implement procedures 
that might prevent adverse events. There are relatively few studies that 
prove the cost-effectiveness of pharmacogenetic testing”. Although 
a multigene panel approach is less expensive than ordering tests for 
one pharmacogene at a time, there are no data to assess the cost-effec- 
tiveness of the panel approach when implemented early on in life and 
used throughout a patient's lifetime. Many health-care systems do not 
provide financial reimbursement for preventive-medicine services or 
for pre-emptive screening, which creates a barrier to pharmacogenetic 
testing in the clinic?™. 

The cost and complexity of the computational approaches needed to 
identify, catalogue, prioritize and interpret genetic variants that influ- 
ence prescribing decisions present another barrier to the clinical uptake 
of pharmacogenomics testing. Although an increasing number of com- 
putational tools for analysing genetic variation are becoming available, a 
substantial level of expertise and manual interpretation is still required 
to apply this analysis successfully in the clinic (Fig. 3). Computational 
tools for CDS, triggered by patient-specific alerts, prompt and guide cli- 
nicians to use genetic information when prescribing affected drugs”. 
The costs associated with the use of pharmacogenomics in clinical prac- 
tice are quickly shifting away from laboratory testing towards the pro- 
cess of linking genetic-test results with evidence-based decisions that 
will robustly guide the prescription of medications and will be updated 
as the latest evidence emerges. However, it is unclear who should take 
responsibility for updating and paying for such interpretations. 

A further barrier to the clinical uptake of pharmacogenomic test- 
ing is the lack of clear guidelines for translating genetic variation into 
actionable recommendations. Professional societies and other guide- 
line-generating groups sometimes disagree on whether to proceed with 
pharmacogenetic testing and, ifso, how. Examples for which there has 
been disagreement include the drugs warfarin” and clopidogrel™. A 
common reason for the lack of support for genetic testing is the dearth 
of randomized prospective controlled trials that compare genetically 
guided testing with conventional therapy. Also, many professional 
societies and guideline-generating groups approach their evaluation 
of pharmacogenomic tests from the standpoint of whether the clinician 
is obligated to order the genetic test” . However, as inexpensive mul- 
tigene tests become available, the question is shifting from whether to 
order a genetic test to how existing genetic-test results can and should be 
used to influence prescribing decisions. For inherited genetic variation, 
the CPIC has undertaken the task of creating guidelines that focus on 
how genetic-test results should be translated into specific prescribing 
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actions. The Royal Dutch Pharmacists Association took a similar 
approach”””'. Multiple resources exist to guide the selection of cancer 
drugs on the basis of somatically acquired genetic variation (Table 2), 
although these constantly change as new evidence arises”. 

As deep sequencing becomes more widespread, further variants 
will be discovered in pharmacogenes”. The challenge will be to cata- 
logue and annotate these variants. Given the importance of rare vari- 
ants for both inherited” and cancer-related pharmacogenes, publicly 
available and easily updatable resources such as PharmGKB, ClinGen 
and ClinVar will be essential for providing the computational CDS in 
health-care record systems with up-to-date recommendations that are 
based on genetic-test results””. Current heterogeneity in genetic-var- 
iation databases and health-care record systems, coupled with a lack 
of acommon ontology, limits interoperability and hinders the use of 
pharmacogenetic test results longitudinally as well as across each of the 
health-care systems the patient must navigate. Several groups are work- 
ing to standardize the language of pharmacogenetic testing”, with 
the aim of creating terminology that can drive CDS across health-care 
record systems. Initiatives such as an Institute of Medicine Roundtable 
on Translating Genomic-Based Research for Health and the CPIC are 
working to create terms and language that can be directly uploaded into 
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Figure 3 | Bringing pharmacogenomic testing to the clinic. Multiple steps 
are required before pharmacogenomic testing can be used in the clinic. First, 
genes or drugs for actionability are identified. Guidelines for all actionable 
inherited pharmacogenes, such as those developed by CPIC”>, exist for 
subsequent steps that are shaded in red. Next, genotypes are assigned to alleles, 
and diplotypes are assigned to patients. Diplotypes are then translated to 
phenotypes (that describe gene function), followed by their interpretion with 
respect to drug therapy. Appropriate CDS is deployed to provide clinicians 
with recommendations for prescribing medications. Pharmacogenetic 
considerations are harmonized with other policies for affected medications, 
including therapeutic drug monitoring, if applicable. Clinicians and patients 
are educated as to what the results mean, what type of CDS and information 
to expect and which medication might be affected. Institutional oversight 
committees can then approve prescribing recommendations and policies, if 
desired. Last, clinical and prescribing outcomes are audited by various groups 
to evaluate the impact of the clinical implementation of pharmacogenomics. 
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the CDS of electronic health records. However, heterogeneity across 
systems will initially slow the creation and uptake of CDS that facilitates 
the use of pharmacogenetic information™. 

Other barriers include the insatiable desire for more evidence, a lack 
of education in clinicians, the scarcity of evidence-based implemen- 
tation systems, and concerns about incidental or secondary findings 
from genetic testing, not to mention inertia in health-care systems”**. 
These barriers are not unique to pharmacogenomics, and the energy 
required to overcome them is likely to come from multiple sources that 
range from the ‘push of patient advocates to the ‘pull of the courtroom. 
The resistance by some clinicians to the ordering or use of pharma- 
cogenetic testing can be perplexing to patients and their caregivers. 
For example, an advocate for paediatric patients expressed a disquiet- 
ing lay perspective: “I am mystified by the resistance to a simple blood 
test that might save children’s lives.””” As the general public becomes 
more aware of the potential for genetic tests to improve the prescrip- 
tion of medications, including through direct-to-consumer testing”, 
its advocacy could grow even stronger. In Hawaii, the attorney general 
brought a lawsuit against the manufacturer of clopidogrel because it 
marketed its drug in the state without warning that a high percentage of 
the Hawaiian population has inherited low-function alleles of the gene 
CYP2C19, which encodes the enzyme required to convert clopidogrel 
to its active metabolite”. The case asserts that it was already known that 
CYP2C19 variant allele frequencies are higher in East Asian and Pacific 
Island populations, which comprise about 40% and 10% of the Hawaiian 
population, respectively. It also states that there was abundant evidence 
to show that the antithrombotic effects of clopidogrel are diminished in 
patients with low CYP2C19 activity (predisposing them to an increased 
incidence of cardiovascular events such as stent thrombosis)”*”*. From 
an educational perspective, multiple accrediting agencies are calling 
for pharmacogenomics to become part of the curricula for health-care 
students, trainees and advanced practitioners”. The availability of phar- 
macogenomic educational tools continues to grow”. Although early 
adopters of clinical pharmacogenomics are now establishing methods 
to advance the treatment of patients, broad clinical implementation 
remains elusive. 


Facilitating clinical use 

Around the world, many groups are working to share resources that 
will facilitate the clinical implementation of germline pharmacogenetic 
tests”. The European Pharmacogenetics Implementation Consortium 
is an international body whose goal is to improve therapy by integrating 
pharmacogenetic information into clinical care’. The Royal Dutch 
Pharmacists Association has also made efforts to facilitate implementa- 
tion’®”. In the United States, members of the NIH Pharmacogenom- 
ics Research Network organized the Translational Pharmacogenetics 
Project” °°"! which is dedicated to sharing best practices for the 
clinical implementation of CPIC pharmacogenomics guidelines. Mean- 
while, the Electronic Medical Records and Genomics (EMERGE) and 
Implementing Genomics in Practice (IGNITE) networks are testing 
pharmacogenetic implementation strategies**'**’°*. In Thailand and 
Singapore, where the HLA-B*15:02 variant is widespread and strongly 
predisposes patients to severe skin toxicity after treatment with spe- 
cific drugs, pharmacogenetic testing is common’****. The Genomic 
Medicine Alliance’ facilitates the clinical use of pharmacogenomics 
and has created a database that links drugs with genes''’. Population 
admixture between previously isolated, diverse human populations 
must also be considered as part of the global effort towards clinical 


implementation’. 


Future directions 

Clinicians are accustomed to making prescribing decisions on the 
basis of patient characteristics such as age, kidney or liver function, 
drug-drug interactions and personal preferences. Much of this takes 
place without optimal CDS to assist in compiling these characteris- 
tics and matching them with evidenced-based choices on medications 
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and doses. As CDS improves and becomes more widespread, and as 
the evidence to support pharmacogenomic testing continues to grow, 
momentum for the clinical implementation of pharmacogenomics 
should accelerate. Going forward, there is a growing body of evidence 
to suggest that pharmacogenomics will become an important compo- 
nent of evidence-based precision medicine. = 
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Gene therapy returns 


to centre stage 


Luigi Naldini*? 


Recent clinical trials of gene therapy have shown remarkable therapeutic benefits and an excellent safety record. They provide 
evidence for the long-sought promise of gene therapy to deliver ‘cures’ for some otherwise terminal or severely disabling 
conditions. Behind these advances lie improved vector designs that enable the safe delivery of therapeutic genes to specific 
cells. Technologies for editing genes and correcting inherited mutations, the engagement of stem cells to regenerate tissues 
and the effective exploitation of powerful immune responses to fight cancer are also contributing to the revitalization of 


gene therapy. 


ene therapy has long fascinated scientists, clinicians and the 

general public because of its potential to treat a disease at its 

genetic roots. This is achieved by counteracting or replac- 
ing a malfunctioning gene within the cells adversely affected by the 
condition. As simple as the concept sounds, the hurdles to put it into 
practice are daunting. Gene transfer must overcome complex cellular 
and tissue barriers to deliver new genetic information into the target 
cell to drive proficient expression of a therapeutic molecule without 
disrupting essential regulatory mechanisms. The gene-corrected cells 
must be present in large enough quantities to reverse the condition, 
escape immunological recognition and survive in the long term, or 
be able to transmit the modification to their progeny to sustain the 
benefit. Several gene-therapy trials have been performed in the past 
two decades for inherited diseases, cancer and chronic infections, but 
only a few reported clear clinical benefits and in some, individuals 
experienced severe adverse events related to the vectors. Overall, con- 
cern and scepticism rose over the further deployment of these strate- 
gies. But these attitudes are radically changing. A number of phase I/ 
II gene-therapy clinical trials have reported remarkable evidence of 
efficacy and safety for the treatment of various severe inherited dis- 
eases of the blood, immune and nervous systems, including primary 
immunodeficiencies, leukodystrophies, thalassaemia, haemophilia and 
retinal dystrophy, as well as cancers such as B-cell malignancies (‘Table 
1). All of these trials exploit improved vector technologies to deliver 
therapeutic genes. In some trials, genetic material is transferred into 
haematopoietic stem cells (HSCs) or T lymphocytes (T cells) ex vivo, 
and in others hepatocytes in the liver or photoreceptors in the retina 
are targeted directly in vivo. Here I review the most relevant clinical 
results, highlight progress in gene-transfer technologies and in our 
understanding of the biological processes that underpin these advances 
and discuss the challenges and outlook for gene therapy. 


Haematopoietic-stem-cell gene therapy 

HSCs have long been a preferred target for ex vivo gene therapy’. Genetic 
modification of self-maintaining multipotent HSCs would ensure a 
steady supply of their gene-corrected progeny in the body. These cells 
have the potential to treat conditions that manifest when mature hae- 
matopoietic lineages fail to develop or to function correctly. Given the 
self-renewing nature of HSCs and the need to ensure that genetic modi- 
fications are passed on to their progeny, gene correction must be stably 


introduced into cellular chromatin, either by vector-mediated transgene 
insertion or by in situ gene editing. 

Past trials of HSC gene therapy report clear benefits to people with 
selected conditions, which proved the therapeutic potential of the strat- 
egy’ '°. However, they also highlight the limitations and risks of using 
early generation vectors that are based on gammaretroviruses (y-RVs)'"”’. 
For example, these techniques offered only a limited ability to transfer 
genes into the most primitive progenitor cells, giving rise to the low-level 
and transient appearance of gene-corrected haematopoietic cells in vivo. 
Leukaemia related to vector insertion near oncogenes also occurred ina 
fraction of patients during their long-term follow-up. The development 
of vectors with improved efficacy and safety in preclinical models, such 
as lentiviral vectors, has renewed interest in the approach’. 

HSC gene therapies that incorporate these new vectors have been tested 
in severe inherited diseases of the immune system (Wiskott—Aldrich 
syndrome (WAS) and X-linked severe combined immunodeficiency 
(SCID-X1))'*"8 and blood (B-thalassaemia) andin neurodegenerative 
storage diseases (adrenoleukodystrophy and metachromatic leukodys- 
trophy)” ” (see Table 1). In children with SCID-X1, the development 
and function of the immune system is impaired owing to deficiencies 
in the receptors for certain cytokines that are essential for immune-cell 
development, whereas those with WAS have deficiencies in a cytoskel- 
etal adaptor that is required for assembling the immunological synapse~. 
Consequently, these children succumb to infection or, in the case of WAS, 
haemorrhage because of an accompanying platelet deficiency. People with 
6-thalassaemia major — the most severe form of B-thalassaemia — fail to 
express the haemoglobin-f chain, which leads to ineffective erythropoie- 
sis and severe anaemia that requires frequent blood transfusions and an 
iron-chelation regimen. Children with early onset adrenoleukodystrophy 
or metachromatic leukodystrophy are affected by defective myelination 
and consequently experience glial- and neural-cell degeneration in the 
central and peripheral nervous systems. They are unable to break down 
some of the metabolites of myelin because of a deficiency in the per- 
oxisomal ATP-binding cassette transporter in adrenoleukodystrophy™, 
or in the lysosomal enzyme arylsulfatase A in metachromatic leukodys- 
trophy”. Such patients undergo rapid and irreversible deterioration of 
their motor, sensory and cognitive performance, which leads to death 
within a few years. HSC transplantation, in which HSCs are transferred 
froma healthy donor to a recipient with a specific condition, can virtually 
cure SCID-X1, WAS and B-thalassaemia”®. HSC transplantation can also 


1San Raffaele Telethon Institute for Gene Therapy (TIGET), San Raffaele Scientific Institute, 20132 Milan, Italy. "Vita Salute San Raffaele University, 20132 Milan, Italy. 
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Table 1 | Gene-therapy clinical trials highlighted in this Review 


Disease Vector and strategy Number of Follow-up Patient status and biological and clinical outcomes* Clinical-trial References 

patients* (months) identifier 

HSC-based gene therapy 

Wiskott-Aldrich Lentiviral vector; 7 10to60 All patients AAW; stable engraftment with transduced NCT01515462 +16 andL.N.t 

syndrome ex vivo gene transfer cells; persistent clinical benefit and safety. 
into CD34" cells. 

Wiskott-Aldrich Lentiviral vector; 7 9to42 6 patients AAW, 1 patient died of a pre-existing infection; NCT01347242 17 

syndrome ex vivo gene transfer stable engraftment with transduced cells; persistent NCT01347346 
into CD34* cells. clinical benefit and safety. NCT02333760 

X-linked severe Self-inactivating y-RV; 9 12to39 8 patients AAW, 1 patient died of an infection; stable NCTO1410019 18 

combined ex vivo gene transfer engraftment with transduced cells; persistent clinical NCT01175239 

immunodeficiency into CD34* cells. benefit and safety in 7 patients; 1 patient failed to engraft NCT01129544 
and underwent HSC transplantation. 

B-Thalassaemia Lentiviral vector; St 24to72 2 patients stably engrafted with transduced cells, N/A 19, 

major ex vivo gene transfer 1 patient became transfusion independent; 1 patient M. Cavazzana 
into CD34" cells. failed to engraft and received rescue cells. and BlueBird 

Biot 

B-Thalassaemia Lentiviral vector; 2t 15 Stable engraftment with transduced cells; transfusion NCT02151526 M.Cavazzana 

major ex vivo gene transfer independence and safety in both patients. and BlueBird 
into CD34* cells. Biot 

6-Thalassaemia Lentiviral vector; 5§ 1to6 Stable engraftment with transduced cells; safety and NCT01745120_ BlueBird Biot 
major ex vivo gene transfer transfusion independence in the first 2 evaluable 
into CD34* cells. patients. 

Adrenoleukodystrophy Lentiviral vector; 4 54tol01 Stable engraftment with transduced cells and safety in all N/A 20, 21 and 
ex vivo gene transfer patients; persistent clinical benefit in 3 patients. P. Aubourgt 
into CD34* cells. 

Metachromatic Lentiviral vector; 20 3to60 Stable engraftment with transduced cells and safety in NCT01560182 22 andL.N.t 

leukodystrophy ex vivo gene transfer all patients; persistent clinical benefit in all late-infantile 
into CD34" cells. patients who were treated when presymptomatic. 

Liver-directed gene therapy 

Haemophilia B AAV8 vector; 10 16to48 Noinhibitors; persistent FIX expression; in high-dose NCT00979238 40 
intravenous group, mean FIX levels of 5.1 + 1.7% seen in all 6 treated 
administration. patients. 

Haemophilia B AAV8 vector; 7 Up to 12 No inhibitors; persistent FIX expression in 1 patient. NCTO1687608 110 
intravenous 
administration. 

T-cell immunotherapy for cancer 

B-cell lymphoma y-RV; ex vivo 15 lto23  8CRs,4 PRs; ORR 80%. NCT00924326 51 

or CLL gene transfer into 
T cells; CAR-modified 
anti-CD19 cells. 

B-cell ALL y-RV; ex vivo 5 1to4 5 CRs; ORR 100%; 4 patients subsequently underwent NCTO1044069 52 
gene transfer into allo-HSC transplantation as per clinical-study design, 

T cells; CAR-modified 1 patient was ineligible for HSC transplantation and 
anti-CD19 cells. relapsed. 

B-cell ALL Lentiviral vector; ex 30 1to 24] 27 CRs; ORR 90%; 19 patients remained in remission, NCT01626495 53 
vivo gene transfer into 3 of these patients underwent HSC transplantation; NCT01029366 
T cells; CAR-modified 7 patients relapsed, 3 of these relapses occurred after 
anti-CD19 cells. loss of transduced T cells. 

B-cell ALL or y-RV; ex vivo 21 Median 14 CRs (at day 28); ORR 67%; 10 patients subsequently NCTO1593696 54 

lymphoma gene transfer into =10# underwent HSC transplantation. 
T cells; CAR-modified 
anti-CD19 cells. 

Retinal gene therapy 

Type 2 Leber AAV2 vector; 5 36 In all 5 patients, stable improvement in visual sensitivity _NCT00516477 94,97 

congenital amaurosis unilateral subretinal seen. 
administration. 

Type 2 Leber AAV2 vector; 3 54 to 72 Inall 3 patients, improvement in visual sensitivity seen NCT00481546 95 

congenital amaurosis unilateral subretinal at 6 months, which increased for 1 to 3 years and then 
administration. declined. 

Type 2 Leber AAV2 vector; 12 36 In 6 patients, improvement in visual sensitivity seen, NCT00643747 96 

congenital amaurosis unilateral subretinal which peaked at 6 to 12 months and then declined. 
administration. 

AAW, alive and well; ALL, acute lymphocytic leukaemia; CLL, chronic lymphocytic leukaemia; CR, complete response; N/A, not applicable; ORR, overall response rate; PR, partial response. 


*As stated in the referenced publications or updated by personal communication. {Personal communication. £8°/B* genotype. §28°/B, 2B°/B°, 18°/B* genotypes. JMedian=7. #51 days to HSC 


transplantation median. 
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arrest the progression of leukodystrophies if performed before or near the 
time of onset, although the outcome of the procedure is more satisfac- 
tory in adrenoleukodystrophy™ than in metachromatic leukodystrophy”. 
Allogeneic HSC transplantation confers a considerable risk of morbidity 
and death, particularly when performed between human leukocyte anti- 
gen (HLA)-mismatched individuals. Even in the few cases in which an 
HLA-matched family donor can be found, the risk of morbidity remains 
substantial”®. HSC gene therapy can address this unmet medical need, 
especially when no matched HSC donor is available. 

HSC gene therapy is administered by ex vivo gene transfer into haema- 
topoietic progenitors. First, the cells are purified using the CD34 surface 
marker from other leukocytes harvested from the bone marrow or mobi- 
lized peripheral blood of the recipient. Next, they are cultured for 2-4 days 
in the presence of growth-stimulating cytokines while being exposed to 
vectors carrying an expression cassette for the corrective transgene. Before 
the modified cells can be administered, the recipient is treated with a pre- 
conditioning chemotherapy regimen. This depletes endogenous progeni- 
tors and differentiated cells in the bone marrow — as wellas the lymphoid 
organs, in some cases — and favours engraftment of the ex vivo gene- 
corrected cells. Preconditioning results in considerable early morbidity 
owing to transitory blood-cell depletion, immunodeficiency and mucosal 
damage, which place the recipient at risk of severe infection. It also causes 
delayed morbidity owing to the risk of developing chemotherapy-induced 
secondary tumours and infertility”. Several HSC gene-therapy trials have 
attempted to alleviate the morbidity associated with preconditioning by 
lowering the dosage and combination of chemotherapeutic drugs that 
are administered in comparison to the regimens used in conventional 
HSC transplantation. However, the impact of changing these drug regi- 
mens on the risks and benefits of the therapy is yet to be determined 
in broader comparative studies and through long-term patient follow- 
up. Some preconditioning regimens also deplete endogenous microglia 
progenitors, which live in the central nervous system (CNS). In patients 
with leukodystrophies who have been treated with HSC gene therapy, 
these cells are replaced with their gene-corrected counterparts, which 
migrate to CNS tissues and produce functional progeny that can clear 
stored myelin metabolites. In the case of metachromatic leukodystro- 
phy, these modified cells also release functional lysosomal enzyme for 
the cross-correction of other tissue-resident cells””*". 

All HSC gene-therapy trials performed with lentiviral vectors report 
stable and high-level reconstitution of haematopoiesis with gene-cor- 
rected cells in most recipients, with the most recent trials observing 
up to 90% reconstitution in some patients'®’””” ”. Because all of the 
haematopoietic lineages tested (myelo-monocytic, megakaryocytic, 
erythroid, natural killer (NK), and B lymphoid and T lymphoid) contain 
gene-corrected cells and appear to receive a steady input of newly formed 
modified cells — most evident in the short-lived myeloid lineages — gene 
transfer must have occurred in self-renewing, multipotent progenitors 
that effectively engrafted to the recipients. The extent of haematopoietic 
reconstitution with gene-corrected cells varies among the patients and 
trials. This reflects the impact of the disease and its correction on the 
different haematopoietic lineages as well as the potency of each vector 
batch and the type of preconditioning regimen administered. Remark- 
ably, most patients gain substantial benefits from gene therapy that can 
even exceed those observed after successful HSC transplantation from an 
allogeneic donor. This is especially true for a metachromatic leukodys- 
trophy trial” based on the rationale that gene transfer could drive higher 
lysosomal enzyme expression in haematopoietic cells than in normal cells, 
which would enable these cells to provide an increased supply of enzyme 
while homing in to affected tissues. The strategy allowed the unsatisfac- 
tory outcome of HSC transplantation to be overcome in metachromatic 
leukodystrophy. Despite an increasing number of patients being treated 
in these HSC gene-therapy trials (see Table 1), there has been no report of 
adverse events related to lentiviral vectors. At the most recent follow-up 
of patients included in these trials, the first of whom was treated almost 
seven years ago in the adrenoleukodystrophy trial”, those who had suc- 
cessfully engrafted with modified cells were reported to be in a good 
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condition — that is, they are either disease free or their disease has been 
stable or in remission since the time of treatment — and are leading near- 
normal lives. These individuals would probably have already succumbed 
to their disease if untreated with gene therapy’. 


Liver -directed gene therapy 

The liver has long been a preferred target for in vivo gene therapy”. This 
major internal organ and central metabolic hub receives an abundant 
blood supply through an extensive bed of sinusoids with highly perme- 
able walls — a structure that facilitates the access of blood-borne particles, 
such as viruses, to hepatocytes. Hepatocytes are long-lived and robust 
protein factories that can efficiently release their products into the blood 
circulation. Stable transgene delivery to the liver could therefore provide 
a strategy for treating several inherited metabolic diseases and plasma- 
protein deficiencies, notably those of coagulation factors. Major hurdles 
to liver-directed gene therapy include the potential toxicity of an acute 
inflammatory response to the bolus administration of viral particles into 
the bloodstream, and the inactivation of these particles by pre-existing 
virus-specific antibodies and clearance by phagocytes that line the sinu- 
soidal walls of the liver and spleen. If humoral or cellular immunity is 
triggered against the transgene product, the therapeutic activity of gene 
therapy could be inhibited in the circulation or the modified cells might 
be eliminated by cytotoxic T cells, respectively**™*, 

Liver-directed gene therapy has been tested mainly in the treatment of 
severe haemophilia B using vectors derived from the human parvovirus, 
adeno-associated virus (AAV)**. On intravascular injection, some AAV- 
vector serotypes effectively target hepatocytes and remain stably associ- 
ated with the modified cells, mostly as nonintegrated episomes within 
the nucleus. In animal models, this delivery strategy does not induce an 
immune response against the transgene product and instead favours the 
development of transgene-specific tolerance”. The persistence of AAV- 
vector genomes that contain an expression cassette for coagulation factor 
IX (FIX) — a protein that is absent or defective in haemophilia B — within 
nonproliferating hepatocytes can give rise to a stable supply of functional 
FIX into the bloodstream”’. FIX expression as low as 1% of the normal 
level can turn severe haemophilia B into a milder form of disease. This 
partial reconstitution alleviates the risk of spontaneous haemorrhages, 
which are detrimental to the joints and potentially lethal if they occur in 
the brain. It can also reduce the need for intravenous prophylactic fac- 
tor-replacement therapy, a treatment that imposes a substantial burden 
on patients and is only available at high cost and in countries with well- 
developed health-care systems”’”. Although earlier trials demonstrated 
the safety of delivering AAV vectors into the human bloodstream, they 
reported only limited and transient FIX expression. This is probably 
because of low levels of gene transduction in the liver and a delayed cel- 
lular immune response that targeted AAV components persisting within 
modified hepatocytes and triggered their elimination’. More recently, 
a trial that used a new AAV serotype (AAV8) and an improved cassette 
design was able to overcome these limitations to demonstrate vector- 
dose-dependent FIX expression of up to 6% of the normal level after a sin- 
gle, well-tolerated vector infusion in a peripheral vein”. Three years later, 
FIX expression remained stable in most patients, although some had to be 
given a transient immunosuppressive corticosteroid treatment at the first 
sign of hepatocellular injury in the 4-8 weeks following treatment. Most 
treated patients were able to reduce or abrogate the need for prophylactic 
factor replacement, which substantially improved their quality of life. 


T-cell immunotherapy for cancer 

T cells are also popular targets for ex vivo gene therapy. Such therapies 
aim mostly at boosting the adaptive immune response against cancer and 
chronic infections such as HIV". Autologous T cells can be harvested 
readily from the peripheral blood and expanded ex vivo. Cells are then 
transduced with a y-RV or lentiviral vector expressing an exogenous T-cell 
antigen receptor (TCR) that is specific to a cancer-associated antigen or 
an antiviral molecule, and infused back into the patient. The use of T cells 
for cancer immunotherapy arose from seminal observations of objective 
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BOX1 
Lentiviral vectors 


Major hurdles for haematopoietic-stem-cell (HSC) gene therapy 
include achieving efficient ex vivo gene transfer into long-term 
repopulating HSCs, preventing activation of oncogenes by the nearby 
integration of a vector and controlling transgene expression to avoid 
ectopic or constitutive expression that leads to toxicity’. As compared 
to early generation gammaretroviral vectors (y-RVs), HIV-derived 
entiviral vectors result in more efficient gene transfer and stable, 
robust transgene expression in HSCs and their multilineage progeny. 
Extensive preclinical work indicated important features in vector 
biology and design that affect genotoxicity and highlighted strategies 
to alleviate it!!!""!”. The self-inactivating long terminal repeats (LTRs) 
and integration-site preferences of lentiviral vectors were shown to 
substantially alleviate insertional genotoxicity. When tested in y-RVs, 
the self-inactivating LTR design was shown to improve the safety of 
this platform as well’®. Retrospective analysis of several earlier trials 
suggests that disease background, transgene function, ex vivo culture 
and the efficiency of host repopulation can all influence the likelihood 
that insertional genotoxicity will manifest in a trial’*"*. These data 
helped to shape the ideas that not all integrating vectors have the 
same effects and that genome-wide integration of improved vector 
designs, although still a mutagenic event, can be tolerated in the 
absence of aggravating circumstances!’ 

Self-inactivating lentiviral vectors are also being used to engineer 
T cells with chimaeric antigen receptors (CARs) or T-cell antigen 
receptors for use in adoptive immunotherapy for the treatment 
of cancer. The advantages of this new platform in comparison to 
earlier-generation y-RVs, which perform satisfactorily in this cell 
target, are yet to be fully established. Lentiviral vectors are thought 
to give rise to more robust and stable transgene expression in 
T cells in vivo, and could facilitate more efficient and versatile ex 
vivo gene transfer while supporting coordinated expression of 
multiple transgenes*!*2°519. These advantages will become more 
relevant as the gene-therapy field implements refined strategies, 
such as improved T-cell manipulation to preserve T memory 
stem cells°**!, or more demanding cell-engineering tasks, such 
as the co-expression of multiple CARs (to improve specificity) or a 
conditional safety switch/suicide gene (to improve safety)!*°. 


tumour responses, which sometimes led to complete tumour regression, 
after the infusion of ex vivo-expanded autologous tumour-infiltrating 
lymphocytes in patients with advanced melanoma”. The underlying 
mechanisms of these responses have been traced to the ex vivo activation 
and amplification — and subsequent in vivo persistence — of tumour- 
specific cytotoxic T cells that already exist in small numbers within some 
tumours but are suppressed by the local microenvironment. These data 
support the therapeutic potential of an adaptive immune response against 
some tumours when it is released from endogenous suppressive signals 
and the reactive T cells are infused at high numbers, which leads to a 
favourable effector-target ratio. Preconditioning patients with a lym- 
phoid-depleting regimen before infusion of the ex vivo-expanded T cells 
improves their engraftment and activity in vivo”. 

The genetic transfer of an exogenous TCR to a T cell can bypass the 
need to find preexisting cancer specificities within the patient’s T-cell 
population. It also allows the exploitation of synthetic high-affinity 
TCRs, which might otherwise be purged in vivo by thymic selection, to 
redirect the specificity of autologous cells. Ideally, such TCRs would tar- 
get tumour-associated antigens from endogenous proteins that are not 
being expressed — at the same time or at detectable levels — by normal 
tissues, such as carcinoembryonic or cancer testis antigens. Alternatively, 
they could target tumour-driver mutations that are commonly found 
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within a certain type of tumour and that cannot be lost, even to evade 
immune clearance. Recent studies, however, indicate that most sponta- 
neous or elicited tumour-specific immune reactivity is instead directed 
against ‘passenger’ neoantigens, which uniquely originate from random 
mutations that accumulate within individual tumours”. This type of 
immunotherapy must therefore be highly personalized and will require 
both the identification of candidate neoantigens from the tumour exome 
or proteome and the retrieval or generation of the cognate TCR recogni- 
tion sequences for ad hoc T-cell genetic engineering. 

Early clinical trials of T-cell gene transfer with TCRs directed against 
tumour-associated antigens reported partial tumour responses and, 
in some instances, off-tumour reactivity that led to tissue damage 
and severe adverse events***’. More recently, gene transfer was used 
to introduce synthetic chimaeric antigen receptors (CARs) to T cells. 
CARs combine the binding specificity of an antibody against a cancer- 
associated surface marker with one or more intracellular signalling 
domains from the TCR and costimulatory receptor complexes”. The 
strategy holds a number of advantages over the use of conventional 
TCRs. For instance, antigen recognition is not restricted by HLA, and 
antibody specificities previously validated for safe and specific cancer 
recognition in vivo can be exploited by the incorporation of single- 
chain antibody derivatives into the CARs. Furthermore, these engi- 
neered T cells can be fully activated when meeting their target. Trials 
that deployed CARs directed against the B-cell surface molecule CD19 
reported dramatic benefits in patients with B-cell malignancies, who 
experienced durable clinical responses, including complete remission, 
with mostly manageable toxicity’ ™. These results have spurred enor- 
mous interest in further developing this approach”. 


Reasons for success in recent clinical trials 
The positive clinical outcomes discussed in this Review provide long- 
sought evidence for the elusive promise of gene therapy to deliver lasting 
therapeutic benefit, or even a ‘cure; for an otherwise terminal or severely 
disabling condition after a single treatment. These results could represent 
the rewarding outcome of the effort to rationally improve vectors in the 
laboratory, as discussed in Box 1 and Box 2. Advances in vector manu- 
facturing and characterization, leading to batches with higher potency 
and greater purity, could also be facilitating the increased transduction 
of target cells with lower adverse effects, even when using the same vec- 
tor design as in earlier trials. A better understanding of the biological 
processes and specific cell types that are involved in the success or fail- 
ure of each gene therapy has also helped to improve methods for ex vivo 
cell handling and led to more effective monitoring and management of 
patients. Indeed, increasing confidence in the positive outcome of recent 
trials is now built on both clinical observation and advanced molecular 
readouts that provide evidence for the efficacy and safety of gene therapy. 
In recent lentiviral-vector-based HSC gene-therapy trials, vector inser- 
tional analysis was used to track individual clones in the reconstituted 
haematopoiesis and showed that polyclonal reconstitution by transduced 
stem cells had taken place without the emergence of dominant clones 
whose behaviour could be attributed to gain-of-function vector inser- 
tional mutagenesis. An in-depth molecular analysis of the reconstituted 
haematopoiesis now being performed in different diseases, including 
several that were treated with different types of vector, is facilitating 
the first reliable comparative assessment of vector-induced events in 
patients'®'”””*°*. The emerging picture illustrates that the absence of 
adverse clinical events in recent trials is accompanied by a remarkably 
different landscape of vector-insertion distribution in patients. In the 
earlier trials, which were performed with long terminal repeat (LTR)- 
competent y-RV vectors, there was frequent and early generation of 
dominant clones, whose expansion appeared to be driven by the altered 
expression of cancer genes that were targeted by vector insertion’ *"**”, 
However, this finding is rarely observed when newer vectors, such as 
lentiviral vectors, with self-inactivating LTRs and different insertion-site 
preferences are used'*””"” ”*, In parallel, retrospective reconstruction of 
multistep leukaemogenesis from the earlier trials supports the view that 


© 2015 Macmillan Publishers Limited. All rights reserved 


vector insertion provided a first mutagenic hit that potentially led to the 
development of leukaemia. Progression of the disease was facilitated by 
accumulation of further vector-independent mutations in the expand- 
ing clone, which were favoured by concurrent precipitating factors such 
as oligoclonal reconstitution, selective pressure for a survival or growth 
advantage, transgene toxicity or stressed haematopoiesis resulting from 
the underlying disease’*’*"*. Overall, these findings support the idea 
that improved efficiency of HSC transduction and transplantation, 
together with improved vector design, can substantially alleviate, but 
not eliminate, concerns about genotoxicity. Once it has been established 
that vector insertion is mostly neutral to cell behaviour, tracking clonal 
activity in the reconstituted haematopoiesis can highlight important 
biological features of haematopoietic regeneration. These include the 
pattern of activity and stability of the transduced HSCs, which provides 
the foundation for predicting long-term maintenance of the therapeutic 
benefit'*”. Tracking clones can also help to establish the reliability of 
current models of lineage relationship and haematopoietic hierarchy 
based on xenotransplantation studies”. 

In adoptive T-cell therapy, a subpopulation of T memory stem cells 
plays an important part in supporting long-term efficacy. This knowledge 
arose from an improved understanding and phenotypic characterization 
of T-cell differentiation pathways, and the identification of the cells that 
are responsible for the sustained generation of effector activity in vivo”. 
Optimization of ex vivo cell culture” and in vivo tracking of gene-marked 
T cells” also contributed to this discovery, which should facilitate the 
design of more effective trials. Epitope mapping and tracking of cancer- 
targeting T cells in patients who are undergoing adoptive T-cell therapies 
and checkpoint blockade drug treatment provide direct evidence for the 
potential to evoke cancer-specific effector T cells in some types of cancer, 
which could then be exploited to clear advanced metastatic disease after 
ex vivo amplification or genetic editing”. 

In liver-directed gene therapy, detailed epitope-specific immune moni- 
toring of antiviral T-cell responses on in vivo AAV gene delivery is able 
to account partially for earlier failures to observe long-term stable FIX 
expression**”’. This also helped to uncover strategies for controlling a 
delayed cytotoxic adaptive response to viral components that persist 
within transduced cells, which include administering transient immuno- 
suppressive drugs at the first occurrence of such a response” and engi- 
neering viral capsids to bypass dominant responses” ™. 

The enhanced efficacy observed in recent gene-therapy trials could 
also reflect improved study designs. These have been made possible by 
a deeper understanding of vector—host interactions and by the applica- 
tion of more rational patient-selection criteria, which are more easily 
adopted after the initial ‘proof of safety’ of administration by vector in 
humans has been obtained. Improved trial designs can enhance effi- 
cacy in several ways. For example, treating early symptomatic (or even 
presymptomatic) individuals instead of people with an advanced stage of 
a disease provides a greater chance for the therapy to work before tissue 
damage has become irreversible. Vector doses can be set at levels that 
approach therapeutic efficacy instead of minimal biological activity, as is 
usually done to alleviate risks when establishing the safety of a treatment 
in phase I/II trials. However, this cautionary approach often denies trial 
participants the chance to gain immediate benefit from the therapy, as 
well as any future benefit (because they become immune to the vector). 
By administering more aggressive preconditioning regimens, space can 
be made for the gene-corrected cells. Vector-induced adverse events can 
also be neutralized by preemptive or prompt pharmacological treatment, 
such as administering anti-inflammatory drugs or steroids to suppress 
innate or adaptive immune responses, respectively. 

Gene-therapy trials often address rare diseases for which little is 
known about the natural history and genotype-phenotype relation- 
ships. Enriching such knowledge helps in the design of more effective 
trials because it becomes possible to recruit the patients who are most 
likely to benefit from the treatment. Efficacy can also be established 
according to validated endpoints within a timeframe that is compatible 
with further development of the therapy. 
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Challenges ahead and prospective developments 

In HSC gene therapy, despite advances in vector design, determining the 
actual risk of insertional mutagenesis in clinical trials — and how it can 
be managed at the level of the individual patient or overcome by further 
improvements in vector design — remain major targets for future pro- 
gress. Several experimental models have been developed to assess and 
rank the relative genotoxicity of different vector types and designs. Despite 
providing the rationale to advance new vectors to clinical testing, they fail 
to provide a quantitative prediction of actual oncogenic risk in the clini- 
cal setting. It is still difficult to establish the actual risk of genotoxicity for 
a given vector in a given trial, especially considering the low number of 
patients that have been treated with vectors to date, the longer follow-up 
periods that are required, and the possibility that the disease background 
could concur to increase the risk. In addition, it remains unclear whether 
molecular monitoring can predict the progression to malignancy of aber- 
rant clones in individual patients. As long as the chosen vectors are able 
to integrate throughout the genome, subject to preferences dictated by 
the parental virus and particle composition, the risk of gene activation 
and inactivation at insertion sites can be mitigated but not abrogated. 
Although the improved vector types and designs that are currently being 
tested could alleviate some of the concerns that surround oncogene acti- 
vation, which represents a high-risk dominant mutagenic event, the dis- 
ruption of tumour-suppressor genes remains possible. These less frequent 
and recessive events might reveal their consequences only after a longer 
latency period or through testing in large-scale trials. 

If efficacy and safety can be established in greater numbers of patients 
and over longer follow-up periods, HSC gene therapy might eventually 
challenge the dominance of allogeneic HSC transplantation as a first-line 
therapeutic option in monogenic diseases for which HSC transplantation 
is beneficial”. In fact, the use of autologous HSCs makes the treatment 


BOX 2 
New adeno-associated 
virus vector serotypes 


Major hurdles for liver-directed gene transfer include alleviating 

the inflammatory response to intravenous administration of high- 
dose viral particles and bypassing pre-existing immunity to viral 
components. Hepatocyte targeting must also be improved and long- 
term transgene expression should be established***. In a recent 
clinical trial of liver-directed haemophilia B gene therapy, an adeno- 
associated virus (AAV) vector that had shown improved liver tropism 
(provided by the AAV8 serotype) and enhanced transduction 
efficiency (obtained by packaging a self-complementary genome) 
in animal models, resulted in the long-term expression of factor IX 
(FIX) in most patients”° as long as a transient immunosuppressive 
corticosteroid treatment was promptly administered to those 

who were developing signs of hepatocellular injury. Whether the 
successful outcome can be attributed to the use of the new vector 
design remains to be established because the dose-response 
relationship and tropism of the different vector serotypes might 
differ between humans and animal models™. Another ongoing trial 
of haemophilia B gene therapy is using the same AAV8 serotype to 
express a hyperactive FIX transgene. Preliminary reports indicate 
that sustained FIX expression occurs only in some patients, despite 
the administration of oral corticosteroids on signs of hepatocellular 
injury'®. It is possible that the immunosuppressive treatment was 
administered too late to be effective at preventing the clearance 

of modified hepatocytes. Alternatively, certain aspects of vector 
design and manufacturing might be missed by current methods 

of characterization, which could influence the robustness of 

the therapy. 
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potentially available to all patients. HSC transplantation, meanwhile, is 
available only to patients for whom an HLA-matched or compatible donor 
can be found. HSC gene therapy can also substantially lower morbidity 
because it abrogates the risk of graft-versus-host disease and abolishes 
the need for immune suppression after treatment. Moreover, it could per- 
mit the use of milder preconditioning treatments, as partial chimaerism 
through the presence of gene-corrected cells might be sufficient to correct 
the disease and spare the patients the risk of myelo- and lymphoablative 
preconditioning treatments. In addition, genetic engineering of HSCs 
might facilitate new modes of treatment if the present outcome of HSC 
transplantation is unsatisfactory. This could involve increasing the thera- 
peutic gene dosage to a level that is higher than the level that is provided 
by normal donor cells (as previously described for metachromatic leu- 
kodystrophy) or delivering transgene activity to selected tissues or sites of 
disease (such as the CNS in metachromatic leukodystrophy). HSCs and 
their progeny could also be equipped with exogenous genetic informa- 
tion to better fight cancer or chronic infection. As therapeutic options 
become available for otherwise incurable diseases, the genomes of new- 
born babies could be screened to allow early treatment of these conditions. 
To fulfil these predictions, ex vivo genetic modification and culturing of 
HSCs and their progenitors must become more robust and reliable. This 
should be coupled with the improved maintenance and expansion of the 
treated cells, which would support faster haematopoietic recovery in pre- 
conditioned patients and increase the clonal composition, resilience and 
stability of the engineered graft — thereby improving the short-term and 
long-term safety of the procedure. When therapeutic benefit is reached on 
establishing a partial chimaerism, less-toxic conditioning regimens based 
on biological agents could also be applied, which would spare patients 
both the acute and long-term toxicity of current chemotherapy-based 
regimens”. 

In T-cell immunotherapy for cancer, the number of CARs in clinical 
testing is growing quickly, although most of the trials focus on treating 
B-cell malignancies using CARs that target B-cell surface antigens. Nota- 
bly, this approach depletes not just the disease-causing malignant clones 
but also almost all B cells in the patient. Although the lack of B cells can 
be remedied by the infusion of immunoglobulins, depletion of other cell 
lineages might not be as manageable. This issue could limit the use of 
other lineage-specific antigens as CAR targets. In addition, the clearance 
of large tumour masses observed in these trials was accompanied by an 
acute and often severe syndrome — even requiring intensive care — that 
followed the massive release of cytokines from on-target activated T cells. 

Given the remarkable benefits that have been observed in some 
patients, further CARs are likely to be designed and tested*?**. However, 
important questions still need to be addressed. For example, what CAR 
design will achieve the greatest efficacy with minimal toxicity? How can 
the toxicities deriving from the CAR-mediated recognition of healthy 
tissues that express target antigens at low levels be controlled in a timely 
and effective way? Can adoptive T-cell therapies be as effective in solid 
cancers as they seem to be in some lymphoid malignancies, or might 
features of the tumour stroma inhibit their activity? Although more chal- 
lenging than transferring CAR genes, transferring TCR genes could be 
more effective in the long term. TCR gene transfer could also be a better 
fit for cancers that tend not to accumulate passenger mutations and might 
be less responsive to checkpoint blockade drugs. By targetting cancer- 
driver mutations, TCR gene transfer might be less prone to the induction 
of resistance than CAR-gene transfer, which has a more limited range of 
targets that must be surface antigens and are not always drivers of the 
disease. TCR gene editing can redirect specificity ofa T cell towards a 
new antigen by disrupting endogenous TCR genes then introducing an 
exogenous TCR or CAR. It could therefore become a powerful strategy 
to avoid TCR mispairing, which occurs when the same cell expresses two 
different TCRs. Moreover, in driving the biological responses of the genet- 
ically modified T cells, it suppresses confounding endogenous TCR sig- 
nal transduction. When combined with growing evidence to support the 
substantial efficacy of immune checkpoint blockade therapy, the outcome 
of recent CAR T-cell-gene transfer trials suggests that immunotherapy 
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could become a new pillar of cancer therapy that has the potential to 
eradicate the disease™. 

In some of the liver-directed, AAV-based, gene-therapy trials discussed 
in this Review, the consequences of a delayed cell-mediated immune 
response that targets transduced hepatocytes could be controlled. How- 
ever, this remains an area of close scrutiny because the factors that might 
concur to trigger the response and regulate its timing are not fully under- 
stood*. For instance, how does the occurrence of this delayed hepatocel- 
lular toxicity relate to the administered vector dose, potency and type? 
And is it related to possible concomitant triggers such as transient sub- 
clinical liver injury, or inflammation that recruits memory T cells at a 
time when viral antigen is still being presented by the transduced hepato- 
cytes? Another unresolved aspect concerns the durability of transgene 
expression over an extended period of time. Although AAV-mediated 
FIX expression remains stable over the life of treated dogs, it is difficult to 
predict whether this also applies to humans, who have longer lifespans. A 
clear understanding of the molecular forms that underlie the long-term 
association of AAV DNA with the hepatocyte nucleus — and their rela- 
tive contribution to transgene expression — will surely provide insight. In 
preclinical studies, a fraction of the AAV genome is reported to have inte- 
grated into the chromatin of hepatocytes, although it is unclear how much 
it could contribute to long-term gene expression”. Although integration 
ensures long-term expression of genes, even in the face of cell replication, 
there are concerns over its safety”””’””. Approaches must be developed that 
enable patients with high concentrations of AAV-neutralizing antibodies 
to access AAV-mediated gene therapy and to allow re-administration of 
gene therapy, possibly by exploiting alternative capsid composition™ 

As confidence grows in the choice of vector type and dosing, patient 
selection criteria can be used to target therapies to individuals who are less 
likely to mount AAV-directed immune responses. New vector serotypes 
will ensure that AAV-based gene therapy becomes more widely applicable. 
For instance, it should be possible to tackle haemophilia A — the most 
common and challenging form of haemophilia — as well as several other 
metabolic and storage diseases that affect the liver and other peripheral 
organs, including muscle and the heart. The remarkable ability of AAV 
gene therapy to establish a long-term clinical benefit through a simple, 
well-tolerated intravenous infusion, combined with the ease of manu- 
facturing a vector-based medicinal product (instead of a personalized 
cell product that is necessary for ex vivo gene therapy), could set the stage 
for its rapid commercial development and broad market distribution. 
AAV gene therapy might also be suitable for addressing unmet medical 
needs in countries with less well-developed health-care systems because a 
single intervention could alleviate the burden of providing and accessing 
lifelong replacement therapy and medical care. Other vector platforms, 
including lentiviral vectors, are also being developed for liver-directed 
gene therapy”’ and could eventually complement AAV-based delivery to 
broaden the access of patients to — and the diseases targeted by — this 
promising type of gene therapy. 


Targeted gene editing 
Another important boost to the gene therapy renaissance has emerged 
from advances in gene-targeting technologies”. The ability to generate 
artificial DNA endonucleases that bind specifically to a DNA sequence 
of choice and induce a double-strand break (DSB) is making targeted 
genome editing more efficient and much easier to undertake, which 
brings the long-sought goals of somatic gene disruption, targeted 
transgene integration and in situ gene correction within the reach of 
gene therapy. All of these strategies entail a ‘hit-and-rur’ mechanism that 
requires only transient expression of the nuclease complex and, in some 
cases, a repair template to modify the genome permanently. Because they 
target a selected region of the genome, these strategies abrogate the risk of 
insertional mutagenesis and poorly controlled transgene expression that 
is associated with conventional gene-replacement strategies. 

Gene targeting has recently entered clinical testing through the adoptive 
T-cell therapy of patients infected with HIV”. Artificial zinc-finger nucle- 
ases (ZENs) were used to disrupt the gene that encodes CCRS, a cellular 
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co-receptor for HIV, in T cells grown ex vivo from each patient — with the 
aim of making these cells resistant to infection with the virus before their 
reinfusion. Gene disruption was achieved by creating a DNA DSB in an 
exon of the CCR5 gene because repair of DSBs by the non-homologous 
end-joining (NHEJ) pathway leads to the loss or insertion of bases during 
end rejoining, which inactivates the coding sequence. The trial reported 
both proof of safety and long-term persistence of the engineered cells in 
vivo. It also provided an indication of efficacy through a trend for the posi- 
tive selection of cells with disrupted CCR5 alleles, as well as an observation 
in some patients of improved control of viral replication on the scheduled 
interruption of antiviral therapy”. To achieve definitive viral clearance, 
however, the fraction and long-term maintenance of infused cells bear- 
ing biallelic CCR5 disruption might need to be increased. This could be 
achieved by performing gene editing in self-renewing progenitor cells, 
such as HSCs” or T memory stem cells, because these cells offer better 
potential for achieving this level of enhanced and long-term reconstitu- 
tion of the T-cell compartment, especially if combined with precondition- 
ing to deplete endogenous non-modified cells. Targeted gene disruption 
can also be used to relieve the repression of an endogenous gene whose 
expression might compensate for the dysfunction of another gene. For 
instance, inactivation of the transcriptional repressor BCL114A in eryth- 
roid progenitor cells could reactivate fetal y-globin expression to compen- 
sate for or counteract a dysfunctional 6-globin chain in B-thalassaemia or 
sickle-cell disease, respectively”. However, a higher efficiency of biallelic 
knockout is required to fulfil these goals when there is no mechanism by 
which to amplify selectively the genetically modified cells. 

Gene editing becomes even more ambitious when aiming to replace 
a target sequence with an exogenous version of choice by exploiting the 
homology-directed repair (HDR) pathway of DNA DSBs. To achieve 
this, artificial nucleases and an exogenous DNA template bearing homol- 
ogy to the target site and comprising the new sequence” must be deliv- 
ered to the cell. The approach has great potential for use in ex vivo gene 
therapy because the targeted integration of an expression cassette into a 
preselected genomic ‘safe harbour***' or the in situ reconstitution of a 
mutant gene would ensure robust and predictable expression — closely 
recapitulating the endogenous expression control in the latter — without 
the risk of insertional mutagenesis”. Several hurdles must be overcome 
before these strategies can be fully exploited. This is because the efficiency 
of HDR-mediated genome editing remains low in most primary cell types 
of relevance to gene therapy, such as HSCs™. In addition, it is challenging 
to achieve the safe and feasible clinical translation of cell-therapy products 
when having to rely on selection and extensive ex vivo amplification of a 
few edited cell clones. The cellular response to DNA DSBs varies accord- 
ing to cell type and cell cycle and growth statuses, and ranges from repair 
by the different pathways to differentiation or apoptosis”. Overall, how 
the cell chooses between NHEJ and HDR is poorly understood. Alterna- 
tive mechanisms for HDR that might function outside the S/G2-phases 
of the cell cycle are also emerging. 

Multiplexed, targeted genome editing is now easily achievable through 
the application of clustered regularly interspaced short palindromic repeat 
(CRISPR)/Cas9 RNA-based nucleases. These nucleases can be rapidly and 
easily adapted to seek out any DNA target site by designing an RNA guide 
instead of generating a protein-based sequence recognition motif for each 
target**. Consequently, multiple applications have been found for targeted 
genome editing in experimental and preclinical models. Translating these 
applications to the clinic will, however, require thorough assessment of 
the off-target activity of the selected nuclease’ and optimization of the 
therapy. Gene addition by HDR or conventional vectors can also be com- 
bined with the targeted disruption of another cellular gene to augment 
effector function, such as by relieving inhibitory checkpoints when editing 
T-cell specificity”. 

Gene-editing strategies are also being developed for use in direct in 
vivo applications, including liver-directed gene therapy. Studies in mice 
have reported the reconstitution of FIX expression in haemophilia B mice 
by AAV-mediated delivery of ZFNs and a repair template”, the targeted 
disruption of the cholesterol regulatory gene Pcsk9 by AAV-mediated 
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BOX 3 


Pricing gene therapy 


The first gene therapy to be commercially approved in the Western 
world was alipogene tiparvovec (also known as Glybera). This 
muscle-directed adeno-associated virus 1-based gene therapy was 
granted marketing authorization in the European Union in 2012 
for the treatment of a rare form of familial dyslipidaemia’. The 
market price was recently set at €1 million (US$1.1 million) per 
treatment’. If forthcoming gene therapies are also sold at such a 
high price, they will challenge the standard reimbursement policies 
of governments and insurance companies. This high price reflects 
the cost of preclinical development, manufacturing and distribution 
of the new medicine, especially for ex vivo gene therapies, which 
are highly personalized and require individualized manufacturing. 
However, gene therapies have the potential to deliver a substantial, 
long-lasting benefit to the patient on a single administration, which 
may offset the cost of the standard treatment of the condition 

and its complications. Nonetheless, a single upfront payment 
model for gene therapy may not be sustainable. Approaches that 
spread the payment over several years should be considered, 
which could be linked to the successful outcome of the therapy. 
For example, a pay-for-performance strategy has been proposed?!” 
that makes the payment of instalments dependent on improved 
patient health, as determined by objective biomarkers. The risks 
are then shared between the health-care provider and insurer and 
the cost of treatment is more closely commensurate to the actual 
benefits delivered to the patient. In addition, the sustainability of 
gene therapies could be improved by adapting regulatory and 
manufacturing requirements to accommodate the unique features 
of these medicines and by facilitating their accessibility and 
distribution without detriment to their safety. A flexible platform- 
based approval and registration strategy should be considered, 
especially when developing gene and cell therapies that must 

be adapted to each individual, as is the case for the transfer of 
T-cell antigen receptor genes that target tumour neoantigens into 
autologous T cells of people with cancer**”. 


delivery of CRISPR/Cas9 (ref. 91), and the repair of the Fah mutation 
in tyrosinaemia (an inability to break down the amino acid tyrosine) by 
hydrodynamic plasmid delivery of CRISPR/Cas9 anda template”. Major 
hurdles to the further development of in vivo gene editing include the safe 
and clinically suitable delivery of the editing machinery, which should act 
transiently without inducing cellular toxicity and immunogenicity. All 
artificial nucleases in current use employ one or more domains derived 
from prokaryotes — in some cases, common bacterial pathogens or 
saprophytes. Therefore, when delivered in vivo, there is a risk that such 
nucleases will generate or encounter a preexisting cell-mediated immu- 
nity, especially if they are expressed over the long term. More recently, an 
endonuclease-independent gene-targeting strategy was demonstrated 
in mice. A transgene was targeted to the albumin locus of hepatocytes by 
administering a promoterless AAV vector with homology to the highly 
transcribed albumin gene”’. A small proportion of the vector integrated at 
the albumin locus and became expressed from its endogenous promoter, 
which removed the requirement for both endonucleases and the transfer 
of a promoter within the vector. The efficiency and underlying mecha- 
nism of this and the other types of in vivo gene editing discussed in this 
Review are still to be fully determined. 


Retinal gene therapy 

The remarkable benefits that gene replacement can provide to patients 
with severe degenerative diseases were first highlighted by retinal gene 
therapy. Subretinal administration of AAV serotype 2 (AAV2)-mediated 


15 OCTOBER 2015 | VOL 526 | NATURE | 357 


© 2015 Macmillan Publishers Limited. All rights reserved 


REVIEW 


gene therapy in patients with type 2 Leber congenital amaurosis (LCA), 
an inherited retinal dystrophy that causes loss of vision at an early age, led 
to improved visual acuity in several young patients in three independent 
trials’ °°. In two of these trials, the patients lost the benefit after a follow- 
up period of 2-3 years. However, sustained benefit was reported after a 
similar follow-up period in the third trial”, which has now progressed to 
phase III testing. The reasons for the different outcome of these trials, all 
of which used an AAV2-derived capsid, remain unclear”. The progres- 
sive degenerative nature of type 2 LCA poses a challenge for delivering 
extended therapeutic benefits because the small number of photorecep- 
tors that are rescued by the therapy can eventually be overcome by non- 
cell-autonomous changes in the tissue. As discussed for liver-directed 
therapy, there might also be subtle differences in vector design and manu- 
facturing that affect the extent of in vivo gene transfer, the inflammatory 
response at the site of delivery and the level of transgene reconstitution in 
transduced cells. The availability of AAV vectors with higher potencies, 
which would allow safer dose escalation and enhanced transduction, and 
more stringent tropism for the relevant targets, should help to overcome 
these limitations. 


Other relevant developments 

There are several other advances in the gene-therapy field that could 
not be discussed in detail in this Review. These include applications in 
neurodegenerative diseases that have reached the clinical-testing stage. 
For example, good safety but limited efficacy has been demonstrated for 
the delivery of transgenes to the brain by AAV or lentiviral vectors”""””. 
Increasing the administered vector dose and optimizing the vehicle, cargo 
and study design are likely to lead to further advances. 

Oncolytic viruses, which infect and kill cancer cells, have been in clini- 
cal use for some time. Although they can deliver robust and clinically 
relevant anticancer activity, these viruses are still being used as part of 
combination therapies”. Oncolytic therapies exploit viral replication and 
the induction of an immune response against infected cells — conditions 
that are normally offset in gene-therapy strategies. Intriguingly, their effi- 
cacy is likely to be augmented by adding a transgene cargo that improves 
the induction of anticancer immunity’. 

Adenoviral vectors of simian origin are also being assessed for their 
ability to induce humoral and cellular immunity through vaccination: 
encouraging results have already been seen in emerging or widespread 
infectious diseases that have long resisted conventional attempts". 
In preclinical models, the gene-based delivery of antibody therapy or 
prophylaxis is being explored to establish an in vivo stable and robust 
source of large quantity of antibodies with optimal specificity to treat or 


prevent infection. 


Future outlook 
Gene therapy could be poised to become an important new approach 
for the third millennium because its reach extends well beyond that of 
conventional drugs. Gene therapy enables the targeted delivery of infor- 
mation-rich gene-based cassettes that facilitate the stable, sustained and 
regulated expression of biological agents. Furthermore, when combined 
with cell therapy, it turns cells into smart vehicles for targeted gene deliv- 
ery. As exemplified in the studies discussed in this Review, gene therapy 
directs powerful biological processes towards the goals of disease correc- 
tion, tissue repair and regeneration. For instance, the stability, fidelity and 
amplification of the delivered therapeutic can be guaranteed by transfer- 
ring information by genetic mechanisms. The homing and trafficking 
mechanisms of cells in the human body can be used to target gene-based 
therapeutics to specific tissues and disease sites. Gene therapy also makes 
use of the regenerative potential of stem cells and transplantation as well 
as the biological weapon of immunity, which is exploited for the specific 
elimination of transformed or infected cells. By taking advantage of these 
inbuilt biological capabilities, gene therapy has the potential to address 
the substantial unmet medical needs of both rare and common severe 
diseases, which will benefit both patients and — more broadly — society. 
Major challenges must still be addressed before this promise can be 
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realized. For example, the efficacy and safety of gene-transfer vectors 
should be improved by further engineering their design and composition, 
which could include combining the biological features of different viruses 
with synthetic molecules. These advances will enable vectors to target tis- 
sues and cell types precisely“’*'”, and overcome cellular restrictions on 
gene transduction and bypass sensors of exogenous nucleic acids. They 
will also help vectors to avoid activating the innate and adaptive immune 
system. Overall, the changes will also ensure that transgene expression is 
reproducible, robust, occurs over an extended period and closely mimics 
the endogenous pattern of expression (when gene replacement is per- 
formed). Improvements in vector manufacturing and characterization 
will allow the standardization and comparative assessment of vector per- 
formance between trials. The rate and specificity of in situ gene correc- 
tion and editing, the integration of vectors at safe genomic harbours, and 
allele-specific silencing by artificial nucleases and epigenetic modifiers 
represent further opportunities for improving gene-therapy strategies. 
Deeper understanding of disease pathogenesis in inherited, multigenic 
or acquired conditions will enable the development of new gene-based 
treatment strategies . Because gene transfers employ ‘live’ biological drugs 
of unprecedented complexity that have the potential to induce extended 
effects on patients and their germ lines, long-term surveillance and pre- 
cautionary measures must be taken while their use is being pioneered. 
Moreover, current — limited — understanding of the regulation of stem 
cells, tissue regeneration and immune-response checkpoints constrains 
the capacity for intervention and raises concerns about the untoward 
effects of manipulation. 

From a clinical standpoint, the bedside delivery of gene and cell thera- 
pies calls for multidisciplinary expertise and, in some cases, advanced cell 
processing at the clinical-treatment site. Biological readouts must also be 
developed to monitor the safety and efficacy of therapies. As the first gene 
therapies progress from registration to marketing, both the pharmaceutical 
sector and regulatory agencies are being engaged to help define appropri- 
ate quality standards for manufacturing and release and to build suitable 
pipelines for supplying such highly personalized therapies. From a societal 
standpoint, the complexity and cost of manufacturing and supplying ‘live 
biological drugs in conventional health-care systems will challenge the 
sustainability of these therapies and require creative cost-reimbursement 
policies that enable all patients to benefit from them (Box 3). 

Finally, from an ethics standpoint, it is important to consider whether 
medicine should surrender to the rule of technology or commit to a more 
responsible steering of the course of progress. For instance, the avenues 
that are being opened to intervention by emerging technologies could 
undermine our self-perception and self-determination as we end up view- 
ing ourselves as the evolutionary product of DNA that has become self- 
conscious and can edit itself to shape its progeny as and when it desires. 
The call for a moratorium on applying genome editing to human germline 
cells highlights forthcoming ethical dilemmas’. Science might turn 
again to the ancient roots of Western culture to learn from its wisdom, 
such as the “Know thyself” inscription on the Temple of Apollo at Del- 
phi in ancient Greece — a warning to recognize our limits. Modern phi- 
losophy has trained us to exercise criticism when assessing the truth and 
certainty of knowledge, which is limited a priori and enables learning 
relationships and making predictions but not uncovering the nature of 
things. Those predictions allow us to develop strategies that can alleviate 
human suffering from disease, thereby providing our endeavours with a 
translational framework that can guide our choices and justify them from 
an ethics standpoint. m 
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Patient-centric trials for therapeutic 
development in precision oncology 


Andrew V. Biankin’***, Steven Piantadosi® & Simon J. Hollingsworth® 


An enhanced understanding of the molecular pathology of disease gained from genomic studies is facilitating the devel- 
opment of treatments that target discrete molecular subclasses of tumours. Considerable associated challenges include 
how to advance and implement targeted drug-development strategies. Precision medicine centres on delivering the 
most appropriate therapy to a patient on the basis of clinical and molecular features of their disease. The development of 
therapeutic agents that target molecular mechanisms is driving innovation in clinical-trial strategies. Although progress 
has been made, modifications to existing core paradigms in oncology drug development will be required to realize fully 


the promise of precision medicine. 


opportunities for the development of therapies with durable clini- 

cal benefit while challenging the existing model of therapeutic 
development and clinical care’”’. Large international consortia — 
such as the International Cancer Genome Consortium** — are map- 
ping the genomes of thousands of cancers to identify opportunities 
for prevention, early detection and treatment®. Although genomics 
is leading the way, high-throughput proteomics and metabolomics 
are following closely behind’. Such methodological advances have 
ushered in a new era of therapeutics that target specific molecular 
processes. Although there have been some dramatic successes*!’, 
the overall strategy remains in its infancy’®. The central premise of 
precision medicine is that matching a drug and its mechanism of 
action using a marker to select patients — a process often referred 
to as matching the right drug to the right patient — can offer greater 
potential for durable clinical benefits. 

Initially, these targeted therapeutic agents followed the same clini- 
cal development pathway as cytotoxic chemotherapy, that is, based on 
tumour location and histopathology, driven by the notion that molecu- 
lar aberrations were tumour specific. Efforts to advance this approach 
stalled because of the lack of efficacy data in patients with different 
cancer types that shared a molecular aberration, coupled with early 
observations that the functional importance of some aberrations var- 
ied between tumour types. However, the emergence of programmes 
that identified molecular targets and matched treatments to molecu- 
lar subtypes — or segments — led to several reports’””®” that directly 
linked this approach to improvements in clinical outcome, irrespective 
of the organ in which the tumour originated. Although many were based 
on retrospective analyses of tumour samples, and not all reports were 
equally convincing”, the utility of broad molecular profiling to guide 
patients towards specific targeted therapies was established. Research- 
ers moved quickly to implement this new paradigm. To meet emerging 
requirements, and enticed by the promise of clinical benefit, clinicians 
recognized that the established pathways of therapeutic development 
would need to change, However, the practical implications of imple- 
menting these changes in the clinic were unclear. 

The drivers of precision medicine have been established and dis- 
cussed elsewhere!*”**. However, fresh challenges for therapeutic 


[ore into the molecular pathology of disease are creating 


development are many and substantial. Fundamentally, a candidate 
treatment requires a strong platform of evidence to support its clini- 
cal testing and must be coupled with robust methods to identify 
appropriate patients (using molecular assays”). Our appreciation 
of the molecular diversity of cancer and the ever-increasing num- 
ber of molecular subtypes creates considerable complexity for the 
development of targeted drugs. When tested in trials of unselected 
participants, most targeted therapies reveal efficacy only if both the 
incidence of a responsive subpopulation and the effect size within the 
group is sufficiently high. Increasing the size of clinical trials to over- 
come this lack of enrichment yields minimal overall benefits at a cost 
that makes them unattractive and unaffordable to the community. 
Designing trials that feasibly evaluate both patient selection and drug 
efficacy is crucial, and it is essential to define the correct metrics to 
assess efficacy, particularly when the study needs to be small. 


Principles and evolution of clinical trials 

Clinical trials are most useful when they assess a potential therapeutic 
effect that is about the same size or slightly smaller than the effect of 
the natural variation that exists between individuals. When the vari- 
ation between individuals enrolled in a trial influences a treatment 
only randomly, it can be ignored in a biological sense and controlled 
by replication. These dual strategies for controlling for variation 
embody the empirical and theoretical aspects of trials. For much 
of the history of clinical trials, the treatments under investigation 
were assumed to apply to anyone with the relevant clinically defined 
condition. Essentially, our understanding of biology suggested that 
treatments worked through common mechanisms that were set apart 
from random variation. This assumption was substantially correct 
for approaches such as cytotoxic chemotherapy that target generic 
disease mechanisms, and it enabled considerable progress to be made 
in treating cancer. Towards the end of the twentieth century, con- 
cerns arose regarding the potential inhomogeneity of therapeutic 
effects because of socio-political characteristics such as race or sex. 
Many clinical trials were designed and analysed to examine such 
differences. Although motivated by politics and social justice rather 
than scientific fact, only minimal changes were actually made to the 
design of such trials — which was probably appropriate given the 


1Wolfson Wohl Cancer Research Centre, Institute of Cancer Sciences, University of Glasgow, Glasgow, Scotland G61 1BD, UK. *The Kinghorn Cancer Centre, Cancer Division, Garvan Institute of 
Medical Research, Sydney, New South Wales 2010, Australia. Department of Surgery, Bankstown Hospital, Sydney, New South Wales 2200, Australia. “South Western Sydney Clinical School, Faculty 
of Medicine, University of New South Wales, Liverpool, New South Wales 2170, Australia. “Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, California 
90095, USA. “Innovative Medicines & Early Development Oncology, AstraZeneca, Cambridge Science Park, Cambridge CB4 OFZ, UK. 


15 OCTOBER 2015 | VOL 526 | NATURE | 361 


© 2015 Macmillan Publishers Limited. All rights reserved 


REVIEW 


a_ Biomarker analysis within existing RCT 


Population 
| 


“Y 


Control 


Biomarker 
analysis 


Outcomes Outcomes 


Treatment 


ce Targeted RCT 


Population 


B Treated 
according 
eshanaail Ganvel to standard 
| of care 
Outcomes 


Figure 1 | Randomized controlled trial designs for defining and testing 
precision-medicine strategies. a, Biomarker discovery is performed in a 
trial that is used to address a therapeutic question but patient recruitment 
and treatment allocation are not informed by the marker status. b, A 
non-targeted biomarker study in which the trial is designed and powered 
to address the biomarker hypothesis to ensure adequate biomarker 


weak biological basis for differences that can be attributed to these 
superficial characteristics. 

The recognition that clinical trials need to be redesigned to account 
for non-random variation comes more from knowledge of the dis- 
rupted cancer genome rather than of the germ line. The implica- 
tions of having multiple potential treatments and diseases where once 
there was just one put enormous pressure on researchers to alter the 
design of clinical trials. Investigators often approach the challenge 
of having too many diseases and too few trial subjects as a result of 
genomic partitioning as a clinical-trial design problem. This cre- 
ates unhealthy tension between design strategies because although 
clinical-trial design must be tailored to answer specific questions that 
arise from targeted therapies, many of these questions are actually 
standard and can be addressed by well-established methodologies. 
Consequently, the challenges of conducting clinical testing for most 
precision-medicine strategies revolve around their feasibility, effi- 
ciency and capacity to deal with multiple small-incidence subtypes 
of cancer and a rapidly evolving knowledge base. 

In response, drug-development pathways have evolved to accom- 
modate two important strategies: generating signals that indicate 
clearly the safety and efficacy of useful treatments, and terminat- 
ing the development of ineffective treatments as early as possible. 
The four phases of clinical trials feed into these strategies. The early 
development phase (phase I) focuses on the safety aspects of a drug, 
including dosage, in a small group of patients. The middle-devel- 
opment phase (phase II) evaluates the safety and efficacy of a drug 
ina larger group of patients, and enables a ‘go/no-go decision to be 
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representation and distribution between arms. c, Biomarker-targeted 
randomized controlled trial (RCT) in which the presence of the selection 
marker guides patient allocation. d, RCT that compares biomarker-directed 
therapy with conventional therapy, which allows the overall concept of the 
biomarker approach to be tested as a whole. Adapted with permission from 
ref. 26. R, randomization. 


made. The late development phase (phase III) constitutes compara- 
tive testing and provides a basis for seeking approval to market the 
drug. Phase IV trials are sometimes performed after market approval 
has been granted to examine the safety and efficacy of the drug in 
other patient populations, as well as any side effects and the implica- 
tions of long-term use. These studies can also extend the applica- 
tions or ‘indications’ of the drug. Through the sequential building of 
evidence, the use of a new therapeutic agent for a specific indication 
can be supported or refuted. In this model, a premium is placed on 
randomized, controlled designs. 

Biomarkers — biological characteristics that can be measured in 
the context of diagnosis and clinical intervention — are often used 
to drive the selection of participants for trials, a strategy known as 
enrichment, which is well established for high-prevalence biomark- 
ers. There are a number of methods for assessing the clinical utility 
of biomarkers (Fig. 1). For example, randomized controlled trial data 
can be analysed retrospectively (Fig. 1a). Biomarker discovery can 
also be integrated within the design of the trial to ensure that there is 
sufficient power to detect signals. Biomarker-positive patients can be 
equally distributed in each arm (known as biomarker stratification) 
to ensure statistical power (Fig. 1b), and the biomarker itself can be 
used to direct the study (Fig. 1c, d)’*”*. Advances in our understand- 
ing of the differences between the molecular pathologies of individual 
cancers creates challenges for conventional drug-development mod- 
els, especially as the prevalence of molecular segments decreases”. 
The chances of showing a significant effect in a traditional compara- 
tive trial of unselected participants diminish if the prevelance of a 
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Figure 2 | Design principles that generate efficiencies in clinical 

trials of targeted therapies. a, In umbrella studies, patients with the 
same type of cancer are screened for a series of hypothesized predictive 
biomarkers. They are then allocated to appropriate therapies within the 
trial architecture. (The biomarker status for each tumour in the study 

is determined by tumour molecular analysis.) b, Basket studies recruit 
patients on the basis of their molecular characteristics irrespective of 

the organ in which their tumour originated. c, The relative incidence of 
molecular subtypes can help to guide decisions as to whether an umbrella 
or basket clinical-trial strategy is most appropriate. Molecular subtypes 


biomarker that identifies tumours most likely to respond to a targeted 
therapeutic agent is low. For example, if the biomarker is present in 
only 2% of the population — a typical prevalence for many, if not 
most, molecular segments” — a study of 50-100 patients yields only 
one or two patients. Unfortunately, no amount of clinical effect in 
such a small number of patients would be enough to advance the 
drug’s therapeutic development (assuming that there is no clinical 
effect in the population who test negative for the biomarker). 
Evaluating a targeted drug or treatment in the early phases of 
development will now more frequently require a trial with a selected 
patient population to minimize the inclusion of individuals who are 
unlikely to respond for mechanistic reasons. Inevitably, this yields 
smaller trials and fewer data on which to base decisions about trial- 
phase transitions. It also creates challenges when developing appro- 
priate comparator populations in early studies. These approaches 
raise a number of interesting questions. For instance, how many 
patients must be evaluated to truly understand the safety and efficacy 
of a drug or treatment? Should later studies remain solely focused on 
the selected patient population and include just one arm? What are 
the drug effects in biomarker-negative patients? Owing to errors in 
diagnosis during routine clinical practice, such patient populations 
will exist even if they are not selected for investigation during the 
drug-development process. How can we build the body of evidence 
needed to support the approved use of a drug or therapeutic agent in 
a particular indication? As a consequence, challenges are introduced 
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can be classified by their organ of origin (left) or on the basis of their 
molecular characteristics or ‘biotype’ (right). Stratification is helpful when 
the incidence of a specific molecular class is low across different organs 

of origin and tends to be tested with a basket approach. Adapted with 
permission from ref. 75. CML, chronic myeloid leukaemia; DDR, DNA 
damage response; ER, oestrogen receptor; GIST, gastrointestinal stromal 
tumour; NB, neuroblastoma; NHL, non-Hodgkin lymphoma; NSCLC, 
non-small-cell lung cancer; Pa, pancreatic cancer; PDAC, pancreatic ductal 
adenocarcinoma (pancreatic cancer); RCC, renal cell carcinoma; SCCHN, 
squamous cell carcinoma of the head and neck; WT, wild type. 


throughout the entire drug-development pathway. These can be 
basic, such as the practicalities of finding enough patients who have 
low-incidence markers to investigate, and understanding the utility 
of the markers used for selection. They can also affect central aspects 
of the drug-development pathway, such as how to generate the data 
packages needed for regulatory submissions and market approval. 


Patient-centric drug development 
The challenges discussed in this Review have resulted in new clinical- 
trial designs (Fig. 2). An umbrella study (Fig. 2a) typically investigates 
a single tumour type selected according to the biomarkers relevant to 
one or more of the candidate drugs, and patients are directed towards 
different arms of the study — and hence towards different therapeu- 
tics — according to the molecular characteristics of their tumour. A 
basket study (Fig. 2b) also selects tumours according to their molecu- 
lar characteristics and biomarkers, but is conducted irrespective of 
tumour type and often focuses on one (or a few) specific markers. The 
approach that is chosen will be based on various aspects, including 
the prevalence of a molecular subtype within a cancer type compared 
with its prevalence across different cancer types (Fig. 2c). Consid- 
eration will also be given to whether initiatives led by cooperative 
groups focussed on specific cancers exist, as well as the practicality 
of implementing these studies, such as the ability to acquire samples 
of tumour for analysis. 

A solution to some of these challenges in targeted-drug 
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Figure 3 | Master protocols for therapeutic development: a framework for the 
clinical testing of precision-oncology strategies — or ‘finding the trial for the 
patient. A guiding principle within the framework is that all patients who are 
eligible for treatment should receive a choice of therapies. These therapies range 
from biomarker-directed or unselected new therapeutic strategies (either as part 
of the trial design or through external trials) to standard-of-care treatment in 
which patients will still be tracked to inform biomarker discovery opportunities 
for existing approved therapeutics. The framework can be enacted bya single 


development is the use of a master protocol, some of which have 
been established for efficiency in certain settings (Fig. 3 and Table 1). 
Rather than using serial, single diagnostic tests to select participants 
for different trials, a single, multiplex diagnostic assay is often used 
to assign participants to different candidate drugs (or arms of a 
trial) within the same trial, or a network of trials. This is sometimes 
referred to asa ‘tent’ protocol, in which multiple trials can be accessed 
through various mechanisms. Such studies offer more options for 
patients and can also make patient screening and recruitment more 
efficient. 

Increasingly, adaptive design features are being incorporated. 
These differ from conventional designs by using accumulated results 
to modify the course or structure of a trial. The ability to make an 
early assessment of the clinical benefit or safety of a drug — and to 
modify the trial in response — is a nimble approach and offers a 
number of advantages. For instance, the trial can be stopped early 
or extended depending on the emerging results, or arms or doses 
can be dropped if no benefit is seen. This approach makes it easier 
to identify populations of patients who are responding to the drug 
being investigated, or to identify fruitful combinations of biomark- 
ers and drugs or other therapeutics. It also allows the randomization 
proportions of the trial population or the rates at which data are 
accrued to be changed. Finally, it permits the inclusion of multiple 
stages of drug development within a single trial. Staged approaches 
such as these can markedly enable the drug-development process 
(Fig. 4). Examples of clinical trials that use these approaches include 
the Biomarker-integrated Approaches of Targeted Therapy for Lung 
Cancer Elimination (BATTLE)” and the Investigation of Serial Stud- 
ies to Predict Your Therapeutic Response with Imaging and Molecu- 
lar Analysis (I-SPY) series**”* of trials for lung and breast cancer. 

Targeted therapeutic development is evolving rapidly, and there 
has been a notable expansion of precision-medicine programmes in 
recent years (Table 1). Combining a detailed understanding of the 
molecular pathology of tumours with modern drugs and associated 
diagnostic technologies for selecting patients has already translated 
into tangible improvements in survival rates for patients with certain 
cancer types’”"””, particularly those with non-small cell lung cancer 
(NSCLC)'*"*. In addition, significant durable responses to immune 
modulatory therapies have been discovered in about 15% of patients. 
These therapeutic agents target specific molecular mechanisms that 
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body or, more pragmatically, through a composite or network of organizations 
and activities with a co-ordinated management and governance structure. 

Stage 1 of the framework includes patient recruitment and molecular-testing. 
Participants are either screened before entering the trial or directed to molecular 
testing to be done within the trial structure itself or by external providers, if 
more appropriate. In stage 2, patients and clinicians are presented with a series of 
attractive clinical-trial options to choose from. This stage also incorporates an 
additional consent process. 


are currently the focus of intense investigation. Patient selection is 
also likely to play an important part in the development of these 
agents, with biomarker hypotheses being actively developed for the 
identification of trial participants’. Data are emerging from early 
programmes such as SHIVA*’, which broadly evaluated targeted 
therapies without taking into account the histology of the tumour in 
end-stage patients for whom standard therapy had failed. Although 
no difference was identified”, it is not possible to draw broad conclu- 
sions from this finding, exemplifying the challenges ahead. 

The oncology landscape is accumulating a growing number of 
patient and tumour groups” that can be identified by (increasingly 
complex) diagnostic assays, which enables them to be coupled to 
molecularly targeted drugs. Up-to-date approvals can be found on 
the websites of the US Food and Drug Administration (FDA)*!® and 
the European Medicines Agency (EMA). Although most approved 
therapies have a linear relationship with a single biomarker, emerging 
data suggest that combinations of biomarkers might better inform 
therapeutic responsiveness, and will continue to challenge biomarker 
development. Similarly, multiple biomarkers could indicate sensitiv- 
ity to a single therapeutic agent, and conversely a single biomarker 
might define patients that would benefit from several therapeutic 
options. Such overlaps are inevitable and it is important to define 
appropriate measures on how to respond to them during the drug- 
development process. The emerging complexity poses substantial 
challenges for current regulatory processes. For example, how should 
researchers assess therapies that do not take the cancer’s organ of ori- 
gin into account, particularly when its prevalence is low ina particu- 
lar organ? How should therapies be assessed at different stages of the 
disease, especially in cases where the patient has undergone several 
prior treatments? A solution might be to apply a broader approach, 
such as defining the level of reimbursement for a particular disease 
stage and line of treatment, with decisions on choice of therapy made 
between clinicians and their patients. 


The challenges of early drug development 

Clinical testing in the early stage of drug development poorly pre- 
dicts efficacy in later stages of development”. Bias in small early 
trials can raise expectations, only to cause disappointment when they 
are expanded to include larger, less-selected and unbiased popula- 
tions. Current tools that provide an improved understanding of the 
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Table 1 | Precision-medicine studies 


REVIEW 


Precision-medicine clinical trials 
Study Tumour Phase/design Location Arms Patientst Clinical trial ID References 
Bisgrove All Phase II, non-randomized United States N/A 84 NCT00530192 19 
MPACT All Phase | United States N/A 1,144 NCT00851032 20 
OSCATO 01 All Phase | France N/A 420 NCTO1566019 21 
Lung-MAP Squamous lung Phase II/III, randomized United States 5 10,000 NCT02154490 49 
BATTLE NSCLC Umbrella, route to four phase II United States 4 300 NCT00409968 (umbrella) 31,66, 67 
randomized NCT00411671 
NCT00411632 
NCT00410059 
NCT00410189 
BATTLE-2 NSCLC Phase Il randomized United States 4 450 NCT01248247 N/A 
BATTLE-FL NSCLC Phase II randomized United States 4 225 NCT01263782 N/A 
I-SPY 2 Breast cancer Phase II randomized United States 8 800 NCT01042379 68, 69 
NCI-MPACT All Phase II stratified, non-randomized United States 6 700 NCT01827384 70 
NCI-MATCH Solid Phase II stratified, non-randomized United States 20 3,000 Umbrella, route to 48 
phase II¢ 
V-BASKET All Phase II stratified, non-randomized Global 160 NCT01524978 71 
CREATE Selected Phase Il stratified, non-randomized European Union 6 582 NCT01524926 N/A 
WINTHER All Stratified, non-randomized European Union 2 200 NCT01856296 72 
SHIVA All Phase II stratified, controlled France 10 1,000 NCT01771458 38 
MOST All Phase II stratified, randomized France 560 NCT02029001 N/A 
SAFIR 02 Lung NSCLC Phase II stratified, randomized France 8 650 NCT02117167 73 
SAFIR 02 Breast Breast cancer Phase II stratified, randomized France 18 460 NCT02299999 N/A 
Lung MATRIX NSCLC Phase II stratified, non-randomized United Kingdom = 21§ 2,000]| EudraCT 2014-000814-73 65 
FOCUS 4 Colorectal cancer Phase II/III randomized United Kingdom 4 643 EudraCT 2012-005111-12 74 
IMPaCT Pancreatic Phase II stratified, randomized Australia 4 90 ACTRN 47 
cancer 12612000777897 
Screening programmes that feed into precision-medicine trials 
Study Tumour Phase/design Location Diagnostics Patientst Clinical trial ID References 
I-SPY Breast cancer Phase Il, diagnostic study United States Genomic, 221 NCT00043017 32-35 
imaging 
NCI-MATCH Solid Screening, route to phase Il United States GST 3,000 N/A 48 
VIKTORY Gastric cancer Screening, route to phase Il Asia GS, other# 600 NCT02299648 N/A 
LC-SCRUM NSCLC Screening, route to phase II/III Asia As Opentt N/A 53 
needed** 
AURORA Breast cancer Screening, route to phase I/II/III European Union GS, 1,300 CT02102165 52 
othert+ 
SPECTAColor Colorectal cancer Screening, route to phase I/II/III European Union Gs 2,600 NCT01723969 50 
SPECTALung Lung Screening, route to phase I/II/III European Union Gs 5008§§ NCT02214134 51 
MOSCATO All Screening, route to phase |/ France CGH array, 1,050 NCT01566019 21 
sequencing 
SAFIR O1 Breast cancer Screening, route to phase |/ France CGH, 423 NCT01414933 73 
sequencing, 
gene 
expression 
array 
CRUK SMP 1 Selected Screening, feasibility United Kingdom Bespoke 9,000 N/A 36 
panel 


BATTLE-FL, Front-Line Biomarker-Integrated Treatment Study in Non Small Cell Lung Cancer; CGH, comparative genomic hybidization; FISH, fluorescence in situ hybridization; IHC, immunohistochemistry; 
IMPaCT, Individualised Molecular Pancreatic Cancer Therapy; IMPACT, Initiative for Molecular Profiling in Advanced Cancer Therapy; MOSCATO, Molecular Screening for Cancer Treatment Optimization; MOST, 
Adapting Treatment to the Tumor Molecular Alterations for Patients with Advanced Solid Tumors: My Own Specific Treatment; N/A, not applicable; NCI-MPACT, National Cancer Institute-Molecular Profiling- 


Based Assignment of Cancer Therapy for Patients with Advanced Solid Tumors; NGS, next-generation sequencing; VIKTORY, Targeted Agent Evaluation in Gastric Cancer Basket Korea Study. 


tEstimated number of patients to be recruited, or the final number recruited where the study has been completed. {The NCI-MATCH programme is a screening programme used to direct patients to single- 
arm, phase Il, signal-seeking studies. §The number of arms will vary because the study progresses as each arm has been designed around a biomarker (for patient selection) and (candidate) drug pair. ||Once 
fully operational, the study will screen 2,000 patients per year. FISH and IHC assays will be used as required. #'Other’ refers to a selection of bespoke and exploratory diagnostics. **Bespoke diagnostics are 
deployed as needed to select patients for the individual clinical studies that feed from the screening programme. {t‘Open’ describes an open and rolling patient-recruitment programme. t+‘Other’ refers to 
RNA sequencing. §§500 patients in year 1 then 500-1000 patients, thereafter. 
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Figure 4 | Adaptive study designs. A within-study analysis or the continual 
assessment of data can be used to change the course of a clinical trial. First, 
the biomarker status for each tumour in the study is determined by tumour 
molecular analysis. After each tumour is allocated to a suitable sub-study, 
further analysis is conducted. Consequently, the sub-study 2 trial arm can 
be stopped owing to a lack of evidence to support the clinical benefit of drug 
2, and the sub-study 3 trial arm can be extended to include more patients. 
Meanwhile, the patient population of sub-study 4 can be redefined into two 
sub-studies, according to the results of responder/non-responder analysis. 


molecular pathology of tumours can be used to inform smaller trials 
as well as to define sources of bias at the molecular level to inform 
early and ongoing therapeutic development. An emerging approach 


is the testing of small numbers of patients underpinned by a deep 
understanding of both the molecular composition of tumours and the 
mechanism of action of the therapeutic agent. Knowledge acquired 
through clinical testing can then inform ongoing preclinical strate- 
gies, which in turn refine the clinical-testing approach — a process 
known as forward-and-backward translation (Fig. 5). 

Inherent to this approach is a desire to define more effective thera- 
pies and to set the bar higher for furthering the progression of a 
therapeutic agent down the drug-development pathway. A shift is 
needed away from the current high-investment drug-development 
approach that is dominated by late-phase trials that predominently 
fail at great expense, towards an approach in which failures are early 
and cheap. This will allow a greater number of potential therapies to 
be assessed while constraining costs. Researchers might even be able 
to test bolder biological hypotheses, particularly in cancers for which 
current therapeutic options are poor. With these tools in hand, and 
developing rapidly, the challenge now becomes to determine how we 
can implement these strategies in the real world. 

Master-protocol clinical trials that use umbrella and basket designs 
to enable trial stages to be run in parallel are efficient. However, the 
subdivision of tumour and therapeutic pairs that they create high- 
lights a need for more innovative solutions and approaches, particu- 
larly in early drug development”””*. For example, there might not be 
enough patients to test the targeted therapeutic using conventional 
designs. Figure 6 shows a suggested strategy for the development 
of therapeutic agents to treat cancer with an overall incidence of 
10 patients per 100,000 individuals per year. Supportive evidence 
for a particular strategy can be classified according to an ‘actionabil- 
ity index’ The development of each therapeutic agent will progress 
within this framework or graduate to pivotal studies when there is 
sufficient evidence. 


Accelerating stratified therapeutic development 

The development of precision therapeutics focuses on leveraging the 
science, however, many important challenges pivot on operational 
components”. These components require the integration of multiple 
complex processes such as participant screening and recruitment 


BOX1 


A number of diagnostic, protocol and operational requirements 
must be considered when designing clinical trials that use multidrug 
portfolios. 


@ Participant screening and recruitment There should be a viable 
means by which to identify low-incidence patient subpopulations 

and to direct individuals to an appropriate clinical trial. Patient- 

centric approaches give individuals access to many options through 

a single screening process. Such screening programmes are usually 
region-wide and collaborative. They can be linked to umbrella and 
basket studies and also to global studies that accept participants 

from diverse screening routes. Drug portfolios are made available to 
these trials through collaborations, and safeguards are implemented 
for proprietary information when multiple partners are involved. The 
multiplexed diagnostic platforms and systems should be harmonized 
or cross-validated to allow patients to be recruited irrespective of 

the technology used by partners. Regulators should be open to 
changes with respect to how these clinical trials are run. The screening 
programmes are underpinned by networks, collaborations and 
reliable partners. 

@ Molecular testing The testing platform and screening or selection 
algorithm should enable broad yet robust tumour and patient profiling. 


Delivering multidrug-portfolio studies 


They should provide viable drug-development routes for larger or 
global studies, regulatory interactions and markets. Samples must 

be used efficiently and data generation should be robust. Overall, 
molecular tests should be cost-effective, transferable and widely 
deployable. Testing should be performed to agreed standards. 

@ Protocols Trials should start with a flexible protocol that can 
incorporate both emerging changes in the science and an 
understanding of patient and tumour biomarkers. Alternatively, they 
could use a confirmatory development protocol that permits regulatory 
interactions that accept different types of data. Such protocols can be 
deployed on their own or in alignment with other protocols. They can 
be modular, rolling or open ended, and must be reviewed efficiently 
according to a centralized regulatory and ethics process. 

@ Availability and delivery of therapies Operational machinery must 
be chosen that allows clinical studies to be conducted in diverse groups 
of patients and over a broad geographical area. Regulatory and ethics 
processes and patient screening and recruitment should be aligned 
and efficient. Therapies can be distributed using hub-and-spoke 
models and cost-effective and efficient delivery of multiple candidate 
drugs to multiple sites can be facilitated through a centralized 
pharmacy. The work should be highly collaborative, spread across 
many groups and involve reliable partners. 
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Figure 5 | Early stratified therapeutic development. An important element 
of early therapeutic development is the use of small trials that are underpinned 
by a deep understanding of tumour molecular pathology, which guides 
ongoing trial development. A stepwise development approach is applied, and 


and the molecular testing of tumours. Rather than pursuing the con- 
ventional goal of finding the patient for the trial, the overall goal is 
to ‘find the trial for the patient’. Protocols must also be flexible and 
therapies must be available and deliverable (Box 1). 


Participant screening and recruitment 

The realities of the conventional screening approach in clinical drug 
development are sobering. For example, consider a candidate-drug 
trial in a subpopulation of patients that were selected by a biomarker 
with a 2% incidence, which has a typical screening failure rate of 15% 
and a patient dropout rate of 15%. The trial would need to screen 
78 patients to find one patient for recruitment, which effectively 
means that 77 patients are discarded. The cost of such an approach is 
equally sobering. Screening using routine single-variable diagnostic 
approaches, such as immunohistochemistry or a single-gene DNA 
test, would have a cost of about US$1,125 per assay, which includes 
performing and processing the assay, as well as logistics and report- 
ing. It would therefore cost $88,235 to screen enough individuals to 
recruit one participant. To conduct a 20-patient phase I expansion 
study in this selected patient subpopulation, the trial would need to 
screen 1,560 patients, at a cost of $1.8 million. 

In addition, the patient’s experience during the conventional 
screening approach is often extremely poor and can involve many 
cycles of disappointment. After first being considered for a trial, the 
patient might then become ineligible to participate if they do not have 
the correct biomarker. They must then undergo repeat biopsies dur- 
ing the search for the next biomarker, and ultimately might receive 
only limited drug options. The physician’s experience is similarly 
poor: his or her options are limited to screening for different bio- 
markers, and associated trials, so long as tumour material is available. 
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interim analyses, trial-population expansions and molecular assessments 
are implemented at specific points. CR, complete response, and PR, partial 
response, based on Response Evaluation Criteria in Solid Tumors (RECIST) 
1.0 criteria”. 


From the operational viewpoint of a clinical trial, this is unsustain- 
able for practical reasons, such as the lack of available tissue and the 
unwillingness of patients and clinicians to participate. 

The need to find sufficient numbers of patients with a specific bio- 
marker has generated many cooperative study groups (Table 1). Con- 
sortia provide multiplexed molecular testing assays — in which many 
biomarkers are measured concurrently — as part of the drug-devel- 
opment process, as well as programmes that offer ‘self-tested’ patients 
access to appropriate therapy either as part of clinical trials or through 
‘off-label’ treatment. In the United States, examples include national- 
level, cross-sector collaborative (including government-based) initia- 
tives such as the National Cancer Institute-Molecular Analysis for 
Therapy Choice (NCI-MATCH)* (solid tumours) and Lung Can- 
cer Master Protocol (Lung-MAP, NCT number NCT02154490)” 
(squamous lung cancer) programmes. Other examples include the 
Screening Patients for Efficient Clinical Trial Access (SPECTA) pro- 
grammes (SPECTAColor” in colorectal cancer (NCT01723969) and 
SPECTALung” in lung cancer (NCT02214134)) and the AURORA 
initiative in Europe” (breast cancer (NCT02102165)), and the Lung 
Cancer Genomic Screening Project for Individualized Medicine in 
Japan (LC-SCRUM-Japan)”’. Cancer-specific advocacy and charity 
organizations also lead cooperative study groups, such as the ‘Know 
Your Tumor’ programme established by the Pancreatic Cancer Action 
Network in the United States. Although these models are advancing 
precision oncology, they are costly because they require intermedi- 
aries to navigate the patient through the health-care system. They 
are also difficult to scale up without fundamental changes in health- 
service delivery. Meanwhile, patients and clinicians are also driving 
forwards new approaches. These approaches include clinical trials 
and other therapeutic options as part of a molecular assay report, 
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Figure 6 | Clinical-testing strategies. Lower-prevalence segments present 

a considerable challenge when testing stratified therapeutic strategies. It is 
also a challenge to determine the level of evidence that is required to embark 
on later-phase studies. The potential approach shown in this matrix is a 
function of the existing level of evidence, the prevalence of the segment, 
which indicates the feasibility of the testing strategy, and current regulatory 
requirements. Trials can progress as the level of evidence increases, and this 
progression can be built into the planned stepwise development process. 


such as FoundationOne from Foundation Medicine, and connec- 
tions to further information, consumer-focused advice, communities 
and patient-led consortia. The broader net that such approaches cast 
helps to identify smaller and smaller subtypes and opportunities for 
individual patients. Strategies that provide genomic health advice™ 
and navigation, such as Perthera, are also gaining traction. Others 
have begun to use electronic media to enable patients and clinicians 
to ‘shop around’ for the best option. These strategies can markedly 
improve efficiency and the patient experience. However, despite these 
efforts, trials using a selection biomarker still constitute only a minor- 
ity of current studies”. 

Recruiting eligible patients onto a clinical trial represents a major 
challenge. If the prevalence of eligible patients is low, it is often nec- 
essary to open a large number of screening centres — a consider- 
able cost, especially since not all will be able to recruit patients. As 
screening programmes expand in size, the cost of funding the search 
for patients shifts from drug developers to health-care systems or 
research platforms. A possible solution is to open clinical trials at 
a location that is accessible to the patient only after they have been 
identified — known as ‘just-in-time’ accessibility. The cost of rapidly 
deploying teams to establish a trial location after a patient has been 
found is likely to be lower than the cost of screening a large number 
of patients. 


Molecular testing 

Although multiplex testing of the coding regions of candidate genes 
offers some options, the complexity of cancer will inevitably require 
more in-depth analyses”. The challenges of delivering molecular 
assays using advanced technologies are discussed elsewhere”, how- 
ever, current tests exploit the relatively direct relationships that exist 
between a specific mutation and the efficacy of a drug. The appraisal 
and delivery of more complex assays that might better identify 
responsive subtypes” is proving to be difficult despite advances 
in clinical-grade diagnostics”. This is mainly due to the rigidity and 
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inertia of established processes for biospecimen handling. Simple 
solutions such as liquid biopsies®” are promising, but could lack 
broad applicability, particularly when complex molecular changes 
must be analysed. Technology considerations aside, it is more impor- 
tant to understand the relevance of any detected changes or muta- 
tions, and the body of evidence that is required to substantiate their 
use for patient selection. Modern multiplex systems such as next- 
generation sequencing technologies reveal the molecular changes 
within a single tumour at an unprecedented level of detail. Many of 
these changes will not have been widely reported: some are likely to 
be specific to that tumour (or tumour region) and there will be little 
previous clinical experience or knowledge for most. In light of this, 
how should therapeutic selection be informed? Although specific 
mutations in a particular gene can confer sensitivity to a particu- 
lar therapeutic agent, what should we do if we discover previously 
unreported mutations in that same gene? And what should we do if 
the potential functional consequences have not been investigated 
yet? Can these mutations reasonably be expected to confer similar 
therapeutic sensitivity? This challenge is being addressed through 
trial design and the diagnostic algorithms that are used to assign 
patients to treatments. We must be careful to avoid reporting a study 
as negative purely because it has not shown any clinical benefit in 
a subpopulation that has been defined by mutations of unknown 
consequence. Not all mutations in a gene will be predictive of clinical 
benefit. Practical solutions to accommodate such uncertainty often 
combine adaptations within umbrella- or basket-shaped trial arms 
that can examine combinations of biomarkers and therapeutics in 
isolation. Different weightings can then be attributed to mutations 
of known and unknown clinical or functional consequence — a pro- 
cess called mutation tiering in which groups are designated as either 
‘tight’ markers that have a high level of supportive evidence or ‘loose’ 
markers that are more exploratory in nature. 


Protocol flexibility 

The administrative and logistical challenges of clinical trials are 
substantial. They impede the ability to respond nimbly to trial find- 
ings, particularly if unexpected, or to data emerging from outside the 
trial. Establishing frameworks and platforms for stratified therapeu- 
tics development will facilitate the deployment of ‘within-protocol 
responses to specific scenarios, which will improve flexibility of trials 
(Box 1). 


Availability and delivery of therapeutics 

Conducting molecular analysis without the prospect of a resulting 
action is of little value. There are comparatively few opportunities 
in routine health care in which multiplexed testing can be applied 
to influence clinical decision-making, and access to appropriate 
therapeutics remains problematic”. Negotiating individual clini- 
cal trials on an ad hoc basis is impractical because of slow legal and 
administrative processes — a closer relationship must be cultivated 
between the pharmaceutical industry and other stakeholders to ease 
this roadblock. The involvement of multiple pharmaceutical partners 
will ensure that a broader range of candidate drugs and appropri- 
ate comparator therapies are available. Wider collaboration between 
tumour-specific consortia, diagnostic and regulatory groups, as well 
as major charities and other interested parties, will also be pivotal. 
A drug-portfolio approach — negotiated as a broad partnership or 
though a consortium strategy — is a necessity, as is the ability to 
deliver therapeutic agents through systems such as a centralized phar- 
macy. The ability to offer patients and clinicians a broad selection of 
attractive treatment options will enhance participation in clinical 
trials. At present, only 2-5% of potentially eligible participants” 
enrol in such trials. Initiatives such as NCI- MATCH”, Lung-MAP”, 
and the Cancer Research UK Stratified Medicine programmes and 
the National Lung Matrix Trial (Lung MATRIX)® (European Clini- 
cal Trials Database (EudraCT) number 2014-000814-73) have set a 
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precedent for prioritizing participation rates. However, the real value 
to the patient and health-care system will be when these strategies 
become commonplace and encompass a greater proportion of drug- 
development portfolios. This will ensure the broad availability of 
therapeutics currently in development. 

Most advances have been achieved by altering drug-development 
strategies to fit into established health-care systems. Consequently, 
progress has been slow. If health-care systems are out of pace with 
the drug-development process, they could be impeding the develop- 
ment of therapeutic agents. Health-care systems that can implement 
precision medicine will greatly facilitate therapeutic development. 
To accelerate progress, health-care systems must be aligned to ensure 
that they are able to test and deliver precision medicine without the 
need for costly overlying clinical-trial infrastructure. 


Future directions 

In recent years, our understanding of the precision-therapeutic devel- 
opment pathway has evolved rapidly. In some areas, targeted-drug 
development has progressed from concept to reality. The frameworks, 
platforms and processes involved are now capable of supporting 
modern oncology drug development. Innovative clinical-trial designs 
— also a central component of development — are highlighting the 
need to better appraise tumour biology, drug efficacy and the poten- 
tial benefits for patients. Emerging drug-development paradigms 
are driving new ways of working collaboratively to accelerate pro- 
gress. By generating truly patient-centric clinical trials, we have taken 
important early steps into the evolving era of precision medicine. In 
some cases, these steps are already enabling us to ‘select’ the trial for 
the patient. However, major hurdles remain, and we must establish 
broad frameworks and systems that integrate closely with health- 
care delivery to accelerate progress and realize the true promise of 
precision medicine. m= 
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Progress and challenges in probing 


the human brain 


Russell A. Poldrack' & Martha J. Farah? 


Perhaps one of the greatest scientific challenges is to understand the human brain. Here we review current methods in 
human neuroscience, highlighting the ways that they have been used to study the neural bases of the human mind. We 
begin with a consideration of different levels of description relevant to human neuroscience, from molecules to 
large-scale networks, and then review the methods that probe these levels and the ability of these methods to test 
hypotheses about causal mechanisms. Functional MRI is considered in particular detail, as it has been responsible for 
much of the recent growth of human neuroscience research. We briefly review its inferential strengths and weaknesses 
and present examples of new analytic approaches that allow inferences beyond simple localization of psychological 
processes. Finally, we review the prospects for real-world applications and new scientific challenges for human 


neuroscience. 


constrained by the methods available to study it. Studies of 

patients with focal brain lesions in the nineteenth century led 
to the view of the brain as a collection of focal centres specialized for 
particular cognitive abilties, such as ‘Broca’s area’ for speech production. 
The development of neurophysiological recording techniques in the 
twentieth century led to Barlow’s ‘neuron doctrine’, according to which 
the functions of individual neurons can be extrapolated to explain the 
function of the brain as a whole. The cognitive neuroimaging studies of 
the 1980s focused on subtractive comparisons between cognitive tasks 
meant to isolate specific cognitive operations, and led to a relatively 
modular view of brain function as involving localized and separable 
regions that implement elementary mental operations. 

The methods of contemporary human neuroscience have provided a 
much more complex and nuanced view of the human brain as a dynamic 
network with multiple levels of organization, in which function is char- 
acterized by a balance of regional specialization and network integ- 
ration. Although current methods are limited in their utility for 
studying brain function at fine-grained levels of organization (such as 
single neurons or cortical columns), human neuroscience has nonethe- 
less made remarkable progress in understanding basic aspects of func- 
tional organization, and with this have come a number of applications to 
address real-world problems. Our goal here is to review the current state 
of human neuroscience, focusing on what kinds of questions can and 
cannot be answered using current techniques and how those answers are 
relevant to real-world applications. 


al: he way that we conceptualize brain function has always been 


How can we study the human brain? 

Methods for studying human brain function can be organized according 
to the kinds of mechanistic insights that each technique provides. As 
shown in Table 1 the first characteristic is the level of mechanism cap- 
tured by the method. Mechanisms range from the molecular level (neu- 
rotransmitters and receptors) to large-scale networks (the dynamic 
integration and coordination of different functional areas of the brain). 
Although this distinction is related to physical scale, it does not depend 
on the method’s spatial resolution per se. For example, positron emis- 
sion tomography (PET) using neurotransmitter ligands measures 
molecular mechanisms, even though its spatial resolution is on the order 


of one centimetre. The second characteristic is the ability of each method 
to elucidate the mechanistic role of an observed brain molecule, cell, 
region or network in a mental function of interest. By mechanism we 
mean the causal chain of events that result in the realization of a func- 
tion. To fully understand human brain function is to know the causal 
chains of events at the molecular, cellular, population, and network 
levels that give rise to psychological function. For this reason, the power 
to identify causal relationships is a crucial dimension of difference 
among methods. 

Some methods used in the study of human brain function provide 
relatively little insight into causal mechanisms. This includes methods that 
exploit naturally occurring variation by observing the strength of asso- 
ciation between individual differences in brain function and behaviour. 
Analysis of relationships between behavioural traits, genes, brain structure, 
and brain function exemplify this approach (see Box 1 for a discussion of 
genomic approaches). For many important psychological phenomena, 
from effects of life history to personality traits, we are limited to obser- 
vational methods. For example, individual differences in the personality 
trait of impulsiveness have been associated with differences in striatal 
dopamine release’, functional MRI (fMRI) activation’, and cortical grey 
matter volume’. Observed associations between neural and psychological 
traits do not necessarily imply a causal relationship, as these associations 
could result from an unmeasured third variable that independently influ- 
ences the two measures. Nevertheless, such associations provide a valu- 
able starting point for theorizing about the neural mechanisms of human 
psychology, and their evidentiary value can be strengthened by measuring 
possible confounds to rule them in or out. 

Although functional neuroimaging, electroencelphalography/mag- 
netoencelphalography (EEG/MEG) and single-cell recordings are some- 
times criticized as being purely correlative and therefore uninformative 
about mechanism, that criticism is only partly accurate. When psycho- 
logical processes are experimentally manipulated by presenting a certain 
kind of stimulus and/or engaging the subject in a task, we can infer 
that any reliably elicited brain activity was caused by performing these 
psychological functions. We cannot, however, infer with confidence 
that the observed brain activity is causally responsible for the psycho- 
logical process under study. Despite this limitation (which is shared by 
neuronal recordings in non-human animals), neuroimaging studies 
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Table 1 | An overview of the levels of analysis and levels of causal inference afforded by different human neuroscience methods 


Level of mechanism 


Molecules Cells Populations Networks 
Strength of causal Purely observational Genetic associations with Structural morphometry Resting functional 
evidence (associations do not behaviour, brain function or correlated with psychological connectivity (fMRI, 
necessarily imply causal brain structure traits EEG/MEG) or structural 
relations between mind Postmortem studies of gene connectivity (sMRI, DTI) 
and brain) expression correlated with 
hological trait 
Correlations of MRI spectroscopy PSven ee eanale 
or PET ligand imaging with 
psychological traits 
Manipulate psychological Task modulation studies using Intracerebral Task activation studies Task-based functional 
process and observe brain PET with neurotransmitter recording in (PET, fMRI, EEG/MEG) connectivity (fMRI, 
(neural measures may be ligands or MRI spectroscopy surgical patients Representational analysis EEG/MEG) 


epiphenomenal) 


Manipulate brain and 
observe psychological 
results (demonstrates 
causal effect of neural 
system in behaviour) 


Pharmacological manipulation 
(including hormones) 


Direct brain 
stimulation in 
surgical patients 


(fMRI, EEG/MEG) 
Computational neuroimaging 
(fMRI, EEG/MEG) 

Focal cortical lesions 


Transcranial magnetic 
stimulation 


Transcranial electrical stimulation 


Disconnection/white 
matter lesions 


Cortical surface electrode 
stimulation in surgical patients 


DTI, diffusion tensor imaging; EEG/MEG, electroencephalography/magnetoencephalography; fMRI, functional MRI; MRI, magnetic resonance imaging; PET, positron emission tomography; sMRI, structural MRI. 


in which psychological processes are manipulated comprise the 
majority of current human neuroscience research, and have advanced 
our understanding of human brain function, as we will discuss in more 
detail below. 


BOX | 
Challenges of merging 
neuroimaging and genomics 


The substantial heritability of many psychological functions has driven 
great interest in finding genetic underpinnings of individual 
differences in neural function. Twin and family studies have 
demonstrated significant heritability for both task-related BOLD 
responses” and resting-state functional connectivity? in fMRI. In the 
past decade, a large number of studies have also reported associations 
between BOLD signals and common variants in candidate genes. 
Unfortunately, this approach has generally been unsuccessful in 
identifying genetic associations that are replicated in genome-wi 
association studies (GWAS). For example, a striking finding from 
first well-powered GWAS of genetic variants associated with brain 
volume was that none of the associations previously identified through 
candidate gene studies were replicated at the genome-wide level®. 
Similarly, candidate gene associations with cognitive function (Such as 
the association between polymorphisms in the COMT gene and 
working memory) and brain activation have generally not been 
confirmed in meta-analyses, and are subject to a substantial degree of 
publication bias®**, Like for many other areas of genetics, this 
suggests that genome-wide approaches are the most likely to lead to 
reliable identification of common variants related to brain structure 
and function. However, GWAS approaches require large samples (in 
the tens of thousands) which are very difficult to amass for task-based 
fMRI studies; for that reason, GWAS-based approaches to probing the 
human brain will likely be limited to resting-state fMRI and structural 
MRI. Other strategies, such as targeted studies investigating rare 
variants of large effect identified using genome sequencing or studies 
using gene expression in peripheral tissues may have greater utility for 
genetic studies of task-based fMRI. Task-based fMRI may also be used 
to further investigate candidate variants identified on the basis of 
GWAS studies of psychiatric disorders or population variability. 
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More decisive evidence concerning causal necessity can be obtained 
by manipulating the brain itself to assess the resulting effect on the 
psychological process in question. Naturally occurring or surgical 
lesions, which provided the basis for most of what we knew about 
human brain function before the advent of neuroimaging, are still of 
great interest because they provide insight into the causal necessity of 
specific brain regions or connections. More recently developed methods 
of brain stimulation allow for reversible inhibition or excitation of a 
brain area, thereby expanding our ability to test the causal role of brain 
regions in the mechanisms of human thought and action. Deep brain 
stimulation (DBS) provides the most precise method for targeted stimu- 
lation by using surgically implanted electrodes, but is limited to situa- 
tions where patients are undergoing implantation for medical reasons. 
Use of non-invasive brain stimulation for research purposes has grown 
rapidly in recent decades, starting with transcranial magnetic stimu- 
lation (TMS), in which pulsed magnetic fields induce currents in the 
brain. Various forms of transcranial electric stimulation (TES), in which 
current is delivered using external electrodes, have also been used, of 
which the most common variant is transcranial direct current stimu- 
lation (tDCS). Unlike DBS, non-invasive brain stimulation generally 
affects larger and more superficial areas of the brain, but researchers 
are seeking to improve spatial resolution with new magnetic coil shapes 
for TMS and new electrode configurations for tDCS. Focused ultra- 
sound is also being explored as a means to stimulate more precisely 
delimited brain regions’. Pharmacological agonists and antagonists of 
particular neurotransmitter systems can be used to experimentally 
manipulate the human brain at the molecular level, although with 
imperfect specificity’. By combining each of these manipulations of 
brain function with functional brain imaging, one can leverage the cau- 
sal information obtained through pharmacological challenges or brain 
stimulation. For example, the causal role of activity in specific brain 
regions, identified using fMRI, for a particular function has been tested 
by brain stimulation, using both direct cortical stimulation (for example, 
ref. 6) and TMS’. 


New capabilities of {MRI 


Because fMRI has become the main method for the study of human 
brain function, our review focuses on this method and new ways of using 
it. In the last two decades, fMRI has transitioned from a newly developed 
technique for revealing neuronal activity to being the workhorse method 
of cognitive neuroscience (see the recent special issue of Neuroimage on 
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the first 20 years of f{MRI*). Much has been learned about the biological 
mechanisms underlying blood oxygen level dependent (BOLD) sig- 
nals*'°, but still much remains to be understood, such as the roles of 
specific glial and neuronal cell types in the coupling of neuronal activity 
to blood flow (for example, refs 11, 12). This limited physiological 
understanding poses problems for the interpretion of fMRI data. In 
particular, although fMRI signals often correlate strongly with both 
action potentials (‘spikes’) and local field potentials, they are largely 
reflective of post-synaptic processes, and in some cases they can be 
dissociated from spiking altogether'’. The relative sensitivity of {MRI 
to post-synaptic processes as opposed to spiking has been seen as a 
drawback by some who view spikes as the essence of brain function, 
but it is worth noting that this discovery has actually rekindled interest 
in the analysis of local field potentials in electrophysiology (where these 
signals have long been discarded) (for example, ref. 14), and suggests 
that {MRI may sometimes be sensitive to subthreshold signals that 
would be missed by analysis of spikes only. Uncertainties in relating 
fMRI to psychological, as well as physiological, processes have also been 
debated, and progress has been made on this front too. From experi- 
mental approaches such as adaptation paradigms for probing represen- 
tations to analyses of functional connectivity, {MRI is routinely used to 
answer questions about mind-brain relationships that go far beyond 
localization’’. Here we discuss three examples of new approaches to 
understanding human brain function with fMRI that address questions 
of representation, computational processes and network interactions 
across the brain. 


Representational analyses 

Early work in neuroimaging focused largely on ‘brain mapping’— 
identifying regions based on the mental processes that cause them to 
be activated. This approach has provided a large body of reliable asso- 
ciations between function and structure, but has not been particularly 
successful in providing new insights into how psychological functions 
are implemented’*®. However, two relatively recent approaches, known 
as multi-voxel pattern analysis (MVPA)”’ and representational similar- 
ity analysis (RSA)'*, can more directly relate psychological contents to 
brain function (Fig. 1). MVPA involves the use of methods from the 
field of machine learning to decode or predict psychological states 
from patterns of brain activation across voxels (hence the term ‘brain- 
reading’). Since its introduction more than a decade ago, MVPA 
has been used in a number of domains to demonstrate the predictive 
ability of {MRI activation patterns. Perhaps the most impressive are 
demonstrations of the ability to successful reconstruct visual scenes'” 
and faces” from BOLD activity patterns; similar advances have been 
made for higher cognitive functions such as word meaning”. These 
studies go beyond simply differentiating between experimental condi- 
tions, as they show how the underlying representational spaces relate to 
brain activity; for example, using a related approach known as voxel- 
wise modelling, Huth and colleagues” developed a model that esti- 
mated the response at each location on the cortical surface to a large 
number of visual and semantic features present in natural movies 
(Fig. 2). MVPA approaches have also provided new insights into the 
neural organization of cognitive functions. For example, MVPA has 
informed our understanding of the mechanisms of visual attention, by 
showing that attention changes both the representation of stimuli 
across regions of visual cortex as well as the mutual information 
between regions”. In the domain of memory, MVPA has been used 
to show that competition between memory representations in working 
memory leads to poorer subsequent memory for those items, dem- 
onstrating a nonmonotonic relationship between competition and sub- 
sequent memory”. 

Whereas MVPA is generally used to decode individual psychological 
states, RSA instead asks how the patterns of brain activity evoked by 
different stimuli are related to one another, and thus provides the 
means to directly address questions of how mental representations 
are implemented in the brain. RSA has enabled the demonstration of 
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Figure 1 | Different approaches to the analysis of {MRI data. This example 
depicts data from a hypothetical study in which four different stimuli were 
presented (two birds and two items of furniture) and response measured for 
each item across nine voxels; intensity of activity is depicted from blue 
(negative) to red (positive). The standard univariate {MRI analysis approach 
would examine the difference at each voxel between the averages of the two 
categories. Multi-voxel pattern analysis (MVPA) examines the 
multidimensional relationship between patterns of activity, in this case 
projecting the nine-dimensional space of voxel patterns (the voxel vector) into a 
two-dimensional space and identifying a boundary that separates items from 
the two classes. Representational similarity analysis (RSA) examines the 
correlations between activity patterns for each item, in this case showing that 
items within category show a high correlation (red), whereas the correlation of 
items between categories is low (blue). 


direct isomorphisms between psychological representations of stimuli 
(such as the similarity or typicality of objects) and the neural patterns 
associated with those stimuli’*’®. Because psychological theories often 
make predictions regarding the similarity of different stimuli, RSA has 
also enabled the direct testing of theories, such as theories about how 
categories are represented’’ and theories of how repeated experiences 
lead to enhanced learning™*. RSA can be applied to any kind of multi- 
dimensional data, and this has enabled the demonstration of systematic 
mappings of visual object representations between humans (using 
fMRI) and non-human primates (using electrophysiological record- 
ings)”’—an example that highlights how human neuroscience can also 
help to establish more direct parallels with findings in non-human 
models, allowing insights to filter in both directions. 

Although much MVPA and RSA work (as depicted in Fig. 1) has 
focused on the representations found in localized brain regions, these 
methods are equally useful for assessing representations that are spread 
across the brain. For example, recent work has shown that mental states 
such as physical pain can be decoded by analysis of patterns of activation 
across brain regions”. 

The legitimate enthusiasm about these methods is tempered by lin- 
gering questions regarding the interpretation of multivariate ana- 
lyses***. In addition, recent work combining electrophysiology and 
fMRI in non-human primates has demonstrated that the sensitivity of 
MVPA is limited by the spatial characteristics of the neuronal represen- 
tations that code for particular features, such that some kinds of neur- 
onal patterns may be more difficult to decode using MVPA than 
others*’. Finally, it is important to stress that, like standard neuroima- 
ging approaches, MVPA and RSA approaches do not inform about 
causal mechanisms. 
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Figure 2 | A mapping of high-dimensional semantic space onto the cortical 
surface. Here, voxel patterns for 1,705 different action and object categories, 
based on brain activity obtained during viewing of natural movies” are mapped 
onto the cortical surface image generated using online browser at (http:// 
gallantlab.org/semanticmovies/). a, Mapping of semantic categories to each 
point on the surface; the colours on the surface map correspond to the semantic 
map in panel b. b, A depiction of the semantic space derived from all 
semantically selective voxels. Categories that have similar colours in the 
semantic space are represented in similar patterns of voxels in the brain. Data 
from ref. 22. 


Integrating {MRI and computational modelling 

Computational models play a central role in our understanding of both 
cognitive and brain functions and, increasingly, of the relationship 
between the two. By making assumptions explicit, computational mod- 
els enable more direct testing of theories, as well as providing the means 
to link computations at the neuronal level with higher-order functions. 
An example of an area in which substantial progress has been made 
using this approach is reinforcement learning, in which an animal 
selects actions and learns from the rewards gained from those actions. 
Computational models of reinforcement learning (RL) have long played 
a central role in artificial intelligence and psychology, and the discovery 
by Schultz and colleagues** that dopamine neurons appear to signal one 
of the important quantities in these models (reward prediction error) 
has brought these models to the forefront of the neuroscience of decision 
making. For example, a set of publications in 2003 applied RL models to 
neuroimaging data and thereby identified correlates of reward predic- 
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tion error signals in dopaminergic target regions such as the ventral 
striatum**”°. Subsequent neuroimaging work has established that there 
are multiple RL signals in the brain, some reflecting the simple asso- 
ciation between actions and values (known as ‘model-free’ RL) and 
others reflecting more complex contextual and hierarchical learning 
processes (known as ‘model-based’ RL)*”’*. Similarly, in the study of 
memory, progress has been made in the mapping of medial temporal 
lobe subregions to specific computational operations such as pattern 
completion and pattern separation (for example, ref. 39). In each of 
these domains, the computational interpretation of neuroimaging sig- 
nals has been greatly enhanced by parallel studies in non-human ani- 
mals, allowing imaging signals to be linked more directly to direct 
measures of neuronal activity. 


Functional connectivity analysis and resting-state {MRI 
Perhaps the most revolutionary development to arise from human neu- 
roimaging research is the realization that the resting brain is far from 
quiescent, and that important insights into brain function can be gained 
from studying the correlated fluctuations of signals across the brain at rest. 
Much of the research into the resting state has focused on a set of regions 
(including anterior and posterior midline regions, lateral temporoparietal 
cortex, and the medial temporal lobe, known as the ‘default mode’ net- 
work") that are consistently less active during performance of difficult 
tasks", and are functionally connected in the resting state*’. Similar pat- 
terns of resting connectivity have been observed in non-human primates* 
and awake rodents“, suggesting that they reflect fundamental principles 
of mammalian brain organization. There is also growing evidence that 
these networks may be important in brain disorders. For example, the 
posterior portion of the default mode network appears to play a critical 
role in the memory deficits observed in Alzheimer disease, showing a 
convergence of amyloid deposition, structural atrophy, and decreased 
metabolic activity”. 

Data collected in the resting state can provide insights into the broader 
functional organization of the brain as well. In particular, the organization 
of resting state signals bears a close relation to the organization of brain 
activity evoked by mental tasks. For example, Smith et al.*° used independ- 
ent component analysis to identify spatially independent sets of voxels 
from resting-state fMRI data and from task-based data (obtained from 
the Brainmap meta-analytic database), and demonstrated that the compo- 
nents extracted from resting-state fMRI showed a high degree of concord- 
ance with those extracted from task-based data. The overlap between 
resting-state and task-based functional organization can also be seen 
within individuals; for example, the longitudinal examination of a single 
individual revealed reliable spatial parcellation of activity in the cerebral 
cortex (using resting {MRI data) that mapped systematically to the activa- 
tion patterns observed across a large number of task measurements”. 

Despite the substantial excitement around resting-state {MRI find- 
ings, numerous concerns have been raised about their interpretation. 
In particular, there are lingering questions regarding the ways in 
which artefacts related to head motion and physiological fluctuations 
may influence estimates of resting state connectivity, and whether 
common data analytic methods may induce systematic artefacts’. 
In addition, potential confounds such as light sleep*° may drive dif- 
ferences in resting state signals. The unconstrained nature of resting- 
state {MRI is a double-edged sword; it is potentially very useful for the 
study of clinical groups for whom task performance may be difficult, 
but at the same time, it is not possible to determine whether group 
differences reflect fundamental differences in functional connectivity 
or relative differences in the ongoing mental content of different 
groups during rest (see ref. 51). 


Applications of human neuroscience 

With the development of new methods have come attempts to apply 
them to real-world problems, in both medical and non-medical con- 
texts. (See Box 2 for a discussion of the ethical, legal, and societal issues 
raised by these applications.) 
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BOX 2 
Ethical, legal and societal impact of 
human neuroscience 


As the methods of human neuroscience find broader application, they 
affect human life in new ways. The field of ‘neuroethics’ is concerned 
with ethical, legal and societal issues raised by these new 
applications®®. 

Two kinds of problems have emerged from the increasing ability of 
brain imaging to reveal aspects of individual psychology: problems 
that arise from the current and imminent capabilities of these 
methods, and problems that arise from their lack of claimed 
capabilities. To the extent that imaging can predict important personal 
characteristics such as health status, academic achievement, and 
criminal behaviour, its use must be managed with care to protect 
privacy and avoid discrimination”. To the extent that imaging cannot 
provide help with high-stake problems, the public should be protected 
from claims that it can. For example, a seemingly ‘scientific’ method 
for detecting lies or diagnosing psychiatric disorders®°°” has a strong 
appeal to the general public who cannot be expected to appreciate the 
gap between what is claimed and what is established fact. 

New ways of changing brain function pharmaceutically, and with 
electromagnetic stimulation, also raise new ethical issues. Of course, 
humanity has long manipulated brain function to modify mental 
states using substances such as alcohol and caffeine. However, 
psychopharmacology has broadly penetrated our everyday lives and 
the scope of psychiatric diagnoses and treatment has expanded—a 
societal shift that some find troubling?®. Furthermore, many now use 
psychoactive drugs purely for enhancement of healthy brain function 
rather than to treat a medical condition®’. Aside from issues of safety 
and efficacy, brain enhancement raises issues of fairness (is it akin to 
doping in sports?), justice (will the ability to access enhancements 
widen the already existing gaps between haves and have-nots?) and 
social standards (will unenhanced job performance become sub- 
standard?). 

Non-invasive brain stimulation is the newest method for brain 
enhancement. Simple transcranial electrical stimulation (for example, 
tDCS) devices are available to consumers at relatively low cost and 
regulation is minimal?©. Given the public interest in this method and 
the rudimentary state of knowledge about its effects, it is crucial that 
the safety and efficiacy of these methods are established. The efficacy 
of cognitive enhancement with tDCS is hotly debated!°! and whether 
long-term use of tDCS is safe has yet to be studied. In addition, 
neuroethical issues of fairness, justice and social standards mentioned 
above also apply to enhancement of brain function by brain 
stimulation. 


Brain disorders 
The methods of human neuroscience hold particular promise for under- 
standing and treating psychiatric disorders, because these disorders do 
not have clear analogues in non-human animals, and animal models 
currently used for preclinical screening of potential therapies are 
increasingly regarded as being inadequate”. In the absence of valid 
animal models, it becomes all the more crucial to apply new methods 
for understanding human brain function and dysfunction. The goal of 
improving the treatment of neuropsychiatric disorders is made even 
more challenging because of our current diagnostic system. Although 
depression, schizophrenia, autism and other serious psychiatric disor- 
ders have long been considered disorders of the brain, they are still 
diagnosed exclusively by behavioural signs and symptoms. These dia- 
gnostic criteria do not seem to have clear relations to the biological 
processes that would be targeted by new medical treatments. 

In response to this problem, an alternative way of systematizing psy- 
chiatric disorders has been developed—the NIMH Research Domain 
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Criteria (RDoC)**—that describes disorders according to impairments 
in specific functional systems of the brain (such as fear or reward learn- 
ing) and at different levels of mechanism of the kind represented in 
Table 1 (for example, molecules or circuits). RDoC characterizations 
cut across traditional diagnostic categories and are intended to capture 
the underlying pathophysiology more accurately. Given the multiple 
levels of mechanism captured by the RDoC, the system encourages 
research with a broad array of methods to identify potentially targetable 
dysfunctions. 

The application of several human neuroscience methods has led to 
the development of targeted treatments, for example, in the field of 
depression. Functional imaging studies have highlighted the role of 
the subgenual anterior cingulate cortex in a network of regions involved 
in mood, leading Mayberg and colleagues to use deep brain stimulation 
in this area to regulate mood in depressed patients”. Lateral prefrontal 
regions, implicated through imaging studies in depression, have been 
targeted with non-invasive brain stimulation, including the FDA- 
approved use of TMS for treatment-resistant depression. Functional 
neuroimaging can itself be used as a treatment, by providing patients 
with a real-time measure of regional brain activity to use as a biofeed- 
back signal. This approach is being tested for the treatment of chronic 
pain, depression and addiction”. In contrast, neuroimaging has not so 
far been very successful in aiding differential diagnosis of disorders in 
terms of current diagnostic categories. A recent large meta-analysis 
identified a set of regions in which structural abnormalities were con- 
sistently associated with psychiatric disorders, but found very little spe- 
cificity for individual disorders”, consistent with the notion that current 
diagnostic distinctions are not biologically realistic categories. 

Another approach to the discovery of therapeutic targets is the use of 
genetic association studies to identify sets of genes that are associated 
with a disorder and that together may indicate particular molecular 
pathways underlying the disorder. Although the numbers of subjects 
needed to establish reliable genetic associations is daunting, progress has 
been made through large international collaborations. For example, 
Psychiatric Genomics Consortium has to date identified more than 
100 common genetic variants reliably implicated in schizophrenia”. 
Imaging can also be used to develop endophenotypes (or intermediate 
phenotypes) that may bear a closer relation to the effect of a gene variant 
than does disease diagnosis, as well as to mitigate the problem of het- 
erogeneity within conventional diagnostic categories (see Box 1). 

It may be less surprising that the methods developed for human 
neuroscience research have been applied in the diagnosis and treatment 
of neurological diseases, but at least two recent developments deserve 
mention here. Studies of Alzheimer disease at mechanistic levels from 
molecules to systems have improved diagnostic accuracy and have 
enabled a degree of prediction before clinical signs of the disease*. 
Molecular biomarkers from blood and CSF, and patterns of brain activ- 
ity and structure have revolutionized clinical research in this area by 
facilitating trials of preventive treatment and by providing intermediate 
phenotypes as early gauges of effectiveness. Disorders of consciousness 
following severe brain damage are another area of clinical neuroscience 
for which neuroimaging shows promise. Some patients who have been 
diagnosed as being in the vegetative state can follow commands to 
imagine actions that activate specific areas of the brain in much the 
same way as healthy control subjects do, and can even use these ima- 
gined actions to answer questions (for example, “Do you have any 
brothers? If yes, imagine playing tennis, if no, imagine walking through 
your house.”)*’. Thus, neuroimaging offers new insights into the assess- 
ment of consciousness, as well as the distinct problem of prognosis, in 
severely brain-damaged patients. 


Predicting behaviour 

The ability to predict future behaviour is of value in almost every sphere 
of human activity. Although it has often been said that “the best pre- 
dictor of future behaviour is past behaviour,” in some cases brain 
imaging can improve our ability to predict future behaviour, over and 
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above what we can do with behavioural history. Marketing professionals 
were among the first to attempt to predict behaviour using brain 
imaging. Recognizing the limitations of focus groups and other tra- 
ditional methods to discern what consumers want, they have used func- 
tional neuroimaging to predict the effects of different advertising 
campaigns, packaging, and other factors on consumer behaviour, based 
on the premise that activity in the brain’s reward or motivation centres 
may be a more direct measurement of wanting than are verbal self- 
reports’. Although most of this work is conducted by and for corpora- 
tions aiming to improve sales rather than share scientific knowledge, 
published academic studies have begun to lend some credence to the 
potential of neuromarketing. For example, when teenage subjects were 
scanned while listening to unfamiliar songs, the reward system activity 
evoked by the songs, but not the subjects’ ratings of their likeability, was 
predictive of sales of the songs over the subsequent three years’. 

Prediction is also important outside of business. Falk and colleagues 
have adapted neuromarketing methods for the purpose of creating 
more effective public service announcements. They showed that brain 
responses (but not ratings) to an anti-smoking advertisement were 
predictive of subsequent call volume to an anti-smoking hotline. 
Gabrieli et al.°° recently summarized evidence concerning neuroima- 
ging-based prediction in domains ranging from healthful eating to 
criminal recidivism, including numerous examples of prediction of 
educational outcomes. Indeed, neuroimaging can predict future aca- 
demic skills over and above traditional behavioural predictors, thus 
enabling earlier and more appropriate interventions to address indi- 
vidual children’s reading and math difficulties. These authors also 
pointed out a number of methodological challenges in neuroima- 
ging-based prediction of behaviour, including the need to develop 
and test predictions with different samples, to avoid the ‘optimism 
bias’ that occurs when predictions are tested in the same population 
from which they were generated. 


Human neuroscience in the courtroom 

In recent years the methods of human neuroscience have found their 
way into the courtroom. Perhaps the most obvious, but also the most 
misunderstood, role for neuroscience is in helping to determine criminal 
responsibility. Proving that a criminal act may have had a neural cause is 
not in itself exculpatory, as every human act is caused by the brain™. 
However, to the extent that neuroscience can provide evidence of mental 
dysfunction (for example, a tumour in the frontal cortex that may have 
impaired the ability to control behaviour), immaturity or other psycho- 
logical grounds for reduced criminal responsibility, it is potentially rel- 
evant and has been used. For example, the Supreme Court explicitly 
cited neuroscience evidence in its decision in Graham v. Florida to 
abolish life in prison without parole for juveniles who commit non- 
homicidal offences. It is more difficult to make legal arguments for 
applying neuroimaging evidence to individual cases because most find- 
ings from neuroimaging research are generalizations based on groups of 
people and may therefore not allow reliable inferences regarding indi- 
viduals®. Nevertheless, neuroimaging scans from defendants are some- 
times presented in the sentencing phase of criminal trials as grounds for 
mitigation of the sentence, as weaker evidentiary standards apply in the 
sentencing phase. 

Neuroimaging can be applied in ways other than determining degree 
of responsibility. Lie detection is one example that has been pursued in 
legal contexts, although it has not so far been admitted into US courts 
and has yet to demonstrate validity, reliability or resistance to counter- 
measures outside of the laboratory®*. Another application concerns 
pain: brain-based biomarkers for pain would help discriminate real 
suffering from malingering—a pivotal issue in many lawsuits—and have 
been admitted as evidence in at least one US case". 


Challenges and future directions for neuroimaging 


The field of neuroimaging is growing rapidly, and there are a number of 
exciting new directions on the horizon. 
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New technologies for imaging and manipulating the 
human brain 

Rapid advances in non-human neuroscience have been driven by the 
development of technologies that measure and manipulate brain func- 
tion with increasing precision. Human neuroscience has lagged in this 
respect, in part because of the ethical challenges associated with direct 
manipulation and neuronal recording of the human brain. However, in 
response to the urgent need for better treatments for psychiatric dis- 
orders, research is underway with the aim to design implantable systems 
for sensing and modulating human brain networks®. The development 
of optogenetic and “opto-fMRI approaches in non-human primates” 
suggests that these methods may one day become feasible for use in 
human studies, and it is likely that electrical brain stimulation will 
eventually be supplemented with optogenetic approaches. Although 
such invasive techniques will likely only be used in rare clinical cases 
(that is, patients are undergoing implantation for medical reasons), 
they have the potential to provide much greater specificity in circuit 
mapping. 

fMRI will probably remain the principal neuroimaging method in 
humans in the foreseeable future. However, the ongoing BRAIN initiat- 
ive in the United States” is providing substantial funding to develop 
entirely new techniques for imaging of brain function, and a significant 
proportion of this funding will go specifically towards the development 
of new methods for imaging the human brain. In addition, new devel- 
opments in MRI have greatly increased the utility of standard MRI 
systems. For example, multiband imaging techniques’”' have enabled a 
several-fold increase in the temporal resolution of fMRI acquisitions, 
and higher MRI field strengths (7 tesla and higher) hold promise 
to enable improvements in spatial resolution as well (for example, 
ref. 72). There is thus great reason to be optimistic that methodological 
limits will continue to be pushed in the future. 

Additional insight into human brain function will likely come from 
the study of postmortem human brains, which has long been a staple 
method for the characterization of anatomical structure and study of 
brain disorders. New techniques have enhanced the ability to visualize 
the structure of human brain tissue (Fig. 3). For example, optical coher- 
ence tomography has been used to image ex vivo human cortical tissue, 
providing high-resolution imaging of cytoarchitecture with less distor- 
tion than standard microscopy techniques”*. The first whole-brain atlas 
of genome-wide gene expression in postmortem human brains” has 
provided an important resource for understanding how gene expression 
relates to brain function; for example, the maps from this project have 


Figure 3 | New methods for characterizing the postmortem human brain. 
a, A map of expression of the serotonin receptor 3B displayed on the 
reconstructed cortical surface in one individual from the Allen Brain Atlas 
Human Brain data set (generated using data from http://human.brain- 
map.org/). b, Optical coherence tomography imaging of the human brain 
(2.9 in-plane resolution). Large panel presents an average intensity projection 
in depth over 300; inset zooms are maximum intensity projections over 300, 
showing fibres in the white matter (pink inset), fibres arcing through the 
subcortical junction to insert into the cortex (cyan inset), and neurons in the 
cortex (bright spots in the green inset). Image courtesy of Bruce Fischl, Caroline 
Magnain and David Boas, Massachusetts General Hospital. 
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been used to identify expression differences across different resting-state 
networks”. Continued development of such resources will be essential 
for progress in understanding the genetic architecture of brain function 
and their relation to mental health disorders. 


Connectomics 

The Human Connectome Project’® is nearing completion, and has 
already provided a rich database for the modelling of functional and 
anatomical connectivity of the human brain. However, fundamental 
challenges remain. For example, diffusion MRI provides the means to 
track white matter pathways (Fig. 4) and has been used to identify white 


SupTGLp 

HG 

Ss ATrCos 
ww TTS 


<< S 
——='5 = 


SSS 


Figure 4| A ‘connectogram”™ for an example healthy adult female subject. 
The outermost ring shows the various brain regions arranged by lobe (fr, 
frontal; ins, insula; lim, limbic; tem, temporal; par, parietal; occ, occipital; nc, 
non-cortical; bs, brain stem; CeB, cerebellum) and further ordered anterior 
(top) to posterior (bottom). The colour map of each region is lobe-specific and 
maps to the colour of each regional parcellation as determined using FreeSurfer. 
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matter connectivity disruptions associated with cognitive disorders such 
as dyslexia’; however, diffusion imaging has inherent biases that limit 
its ability to accurately track connections across the entire brain”. The 
last decade has seen a proliferation of approaches to model functional 
connectivity on the basis of functional MRI data, though the dust has yet 
to settle regarding which methods are most effective (for example, 
ref. 80). To determine this, the analysis methods must be validated, 
which is challenging to do in humans but may be achieved using direct 
measurements of functional connectivity from invasive human 
approaches and non-human animals to validate the neuroimaging 
results. There is increasing evidence that at least in non-human primates 
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The set of five rings (from the outside inward) reflect grey matter volume, area, 
thickness, curvature, and connectivity density. The lines inside of the circle 
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functional connectivity reflects anatomical connectivity as measured 
using either diffusion MRI*! or anatomical tract-tracing™; but it remains 
an important challenge to establish the ways in which functional and 
diffusion connectivity measures converge or diverge. 


Reproducibility of neuroimaging research 

Large-scale meta-analyses have made it clear that neuroimaging results 
can be highly convergent across studies, to the degree that cognitive 
processes can be accurately inferred from individual subject data using 
decoders trained on meta-analytic data based on reported activation 
coordinates*’. However, the last few years have also seen increasing con- 
cern regarding the reproducibility of research findings in neuroscience, 
paralleling more general concerns about reproducibility of scientific 
results**. These issues are particularly acute for neuroimaging given the 
high dimensionality of the data, relatively low statistical power of many 
studies*’, high degree of analytic flexibility in data analysis procedures*®, 
and potential for questionable research practices such as circular 
analysis procedures*’. The field of neuroimaging has been at the forefront 
of a number of developments that aim to improve reproducibility and the 
sharing of data are increasingly being embraced. The Alzheimer’s Disease 
Neuroimaging Initiative (ADNI), International Neuroimaging Data 
Sharing Initiative (INDI), ENIGMA, and the Human Connectome 
Project together have shared thousands of neuroimaging data sets and 
this has enabled a number of novel discoveries. For example, data sharing 
by the ENIGMA consortium has enabled the first well-powered genome- 
wide association study of brain volume™, identifying replicated associa- 
tions between brain volume and several common genetic variants. In 
addition, nearly all of the main software packages for neuroimaging data 
analysis are free and open source, providing transparency and repro- 
ducibility in data analysis across groups, and the publication of fully 
reproducible analysis workflows has begun (for example, ref. 89). The 
increasing use of machine learning methods, with their focus on out-of- 
sample generalization rather than statistical significance, is also leading to 
a greater emphasis on achieving reproducibility. 


Outlook 


The use of new tools for imaging and manipulating the brain will con- 
tinue to advance our understanding of how the human brain gives rise to 
thought and action. The combination of myriad methods with different 
and complementary strengths and weaknesses will allow neuroscientists 
to develop a multilevel understanding of the brain, spanning from mole- 
cules to large-scale networks. New analysis methods have advanced 
fMRI beyond ‘blobology’ and will provide direct insight into the map- 
ping of mental and neural representations, while newer analysis and 
acquisition methods will offer other novel insights into the relation of 
mind and brain. fMRI and other human neuroscience methods will 
continue being applied to solve real-world problems, within medicine 
and beyond. Although some of these applications are currently pre- 
mature relative to the demonstrated capabilities of the methods, it is 
clear that the new methods of human neuroscience will have much to 
offer science and society. 
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A Cretaceous eutriconodont and 
integument evolution in early mammals 


Thomas Martin!, Jesus Marugan-Lobon”’, Romain Vullo*, Hugo Martin-Abad?, Zhe-Xi Luo” & Angela D. Buscalioni? 


The Mesozoic era (252-66 million years ago), known as the domain of dinosaurs, witnessed a remarkable ecomorphological 
diversity of early mammals. The key mammalian characteristics originated during this period and were prerequisite for 
their evolutionary success after extinction of the non-avian dinosaurs 66 million years ago. Many ecomorphotypes 
familiar to modern mammal fauna evolved independently early in mammalian evolutionary history. Here we report a 
125-million-year-old eutriconodontan mammal from Spain with extraordinary preservation of skin and pelage that 
extends the record of key mammalian integumentary features into the Mesozoic era. The new mammalian specimen 
exhibits such typical mammalian features as pelage, mane, pinna, and a variety of skin structures: keratinous dermal 
scutes, protospines composed of hair-like tubules, and compound follicles with primary and secondary hairs. The skin 
structures of this new Mesozoic mammal encompass the same combination of integumentary features as those evolved 
independently in other crown Mammalia, with similarly broad structural variations as in extant mammals. Soft tissues in 
the thorax and abdomen (alveolar lungs and liver) suggest the presence of a muscular diaphragm. The eutriconodont has 
molariform tooth replacement, ossified Meckel’s cartilage of the middle ear, and specialized xenarthrous articulations of 
posterior dorsal vertebrae, convergent with extant xenarthran mammals, which strengthened the vertebral column for 


locomotion. 


Mammals have diverse integumentary structures such as hair, spines, and 
scutes’. Fossilized fur is known for Mesozoic mammals from the Middle 
Jurassic period onwards, and can be traced back to the early-divergent 
mammaliaform clades”. So far, however, fur has fossilized only as hair 
impressions and compressions, types of preservation that rarely show 
microstructure. The exceptional phosphatized preservation of skin tis- 
sues in the specimen from Las Hoyas extends direct knowledge of mam- 
malian integumentary microstructure from 60 million years ago (Ma) 
(Late Palaeocene epoch)* back to 125-127 Ma (Early Cretaceous). The 
new Spanish gobiconodontid (Eutriconodonta, Mammalia) combines a 
complete and articulated skeleton, with extraordinary preservation of 
skin, hair, keratinous dermal scutes, and remnants of visceral organs. 

The most striking feature of extant mammalian hairs is their outstand- 
ing polymorphism, among closely related species, at different devel- 
opmental stages, and even in the same individual’. The new fossil 
shows the prominent polymorphism in a variety of keratinous appen- 
dages, some of which are described for Mesozoic mammals for the first 
time. The cylindrical structures (protospines) are formed from the fusion 
of cylindrical micro-tubules, architecturally similar to the spines of mod- 
ern mammals. Such protospines represent the earliest record of a micro- 
structure analogous to that of spiny mice (Acomys) and hedgehogs 
(Erinaceus). As in extant mammals, hair shafts have three architectural 
units: cortex, medulla, and cuticles with varied designs. Moreover, at a 
microscopic scale the epidermal layer preserves compound follicles with 
more than two hairs stemming from the same follicle. The structural 
diversity of hairs in this new mammaliaform indicates that polymorph- 
ism of hairs and related structures is probably correlated with the same 
developmental process as in extant mammals’. 

On its posterior dorsal vertebrae the new eutriconodont has additional 
xenarthrous articulations in addition to the usual zygapophyses. This is 
well known for Xenarthra (armadillos, anteaters, and sloths) among 


extant placentals, and similar structures have been observed in some 
Eocene eutherians’*”. The fact that features of xenarthral articulation 
are also present in the Late Jurassic mammaliaform Fruitafossor’ makes 
the new mammal from Spain another case of convergent evolution of 
this remarkable skeletal feature that strengthens the dorsal vertebral 
column for versatile locomotor functions. 


Class Mammalia* 
Order Eutriconodonta’® 
Family Gobiconodontidae’® 
Spinolestes xenarthrosus gen. et sp. nov. 


Etymology. Spinosus (Latin), in reference to the spiny integument; 
2éotns (Greek) or lestes (Latin spelling), meaning robber and a common 
term in taxonomic names of mammals. The specific name xenarthrosus 
refers to the special additional (€évoc, (Greek) strange) articulation facets 
(&p8pov, (Greek) articulation) of the dorsal vertebrae. 

Holotype. MCCMLH30000, Museo de las Ciencias de Castilla-La 
Mancha. A complete skeleton with integumentary structures preserved. 
Skeletal and integumentary remains of the slab MCCMLH30000A were 
transferred to a matrix of epoxy resin in preparation, to expose the 
embedded side of the fossil, but are kept intact in the counter slab 
MCCMLH30000B (Figs 1 and 2a and Extended Data Fig. 1). Life 
Science Identifier: urn:lsid:zoobank.org:act:10B31072-A727-4663- 
A7DC-4224FB97E1E3. 

Locality and horizon. Las Hoyas Quarry, Calizas de la Huérgina 
Formation, southwestern Iberian Basin (Cuenca, Spain). Las Hoyas 
is latest Barremian (125-127 Ma) in age, on the basis of charophytes 
and ostracodes'’. The Las Hoyas Konservat-Lagerstatte occurs in 
finely laminated limestones deriving from a freshwater wetland. 
Fossils are usually preserved fully articulated, including soft tissues 
such as mineralized muscle and skin. Potential mechanisms for 
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Figure 1 | New Early Cretaceous gobiconodontid Spinolestes xenarthrosus. 
a, Skeletal reconstruction from the type specimen. b, Drawing of holotype 
MCCMLH30000A with skeletal element identification and outline of preserved 
integument. Arrow points to skin patch preserving hair bundles, placed 
between dorsal vertebrae 11 and 14. Inset: left calcaneus in dorsal view, on the 
transferred slab separate from the skeleton. Abbreviations: C, caudal vertebra; 
D, dorsal vertebra; |, left; r, right; S, sacral vertebra; c, Reconstruction of 
dentition with deciduous molariforms (grey shading). Abbreviations: C/c, 


exquisite preservation are microbial mats, anoxia, and rapid burial by 
sediments’®. 

Diagnosis. Dental formula P-C!-P?-M*/i3—c,-p>—m4. Cheek teeth of 
general ‘amphilestid’ pattern (Fig. 1c and Extended Data Fig. 2a-c). 
Differs from stem mammaliaforms in lacking the postdentary trough. 
Differs from the ‘amphilestids’ Amphilestes and Phascolotherium in 
fewer premolars, fewer molariforms, or both; from Hakusanodon” 
and Juchilestes*° in fewer molariforms. Differs from Jeholodens and 
Yanoconodon’’* by the presence of accessory cusps d and e of lower 
molariforms and a lingual cingulum in upper molariforms. Differs from 
Liaoconodon”’ in having one fewer lower incisor and one more upper 
molariform, and lacking the uniform enlargement of incisors and 
canines of the latter. Differs from genera of the Triconodontidae in 
absence of vertical tongue-in-groove interlock of molariforms, and more 
uniform heights of cusps of the latter. Unique among gobiconodontids 
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upper/lower canine; I/i, incisor; M/m, molariform; P/p, premolar; D/d, 
deciduous tooth; R/r, replacing tooth. d, Reconstruction of right hand. 
Numerals designate finger rays. Abbreviations: ra, radius; ul, ulna. e, Dorsal 
vertebrae 14-17 with xenarthrous articulations, SEM images (stereo pair). 

f, Drawing of dorsal vertebrae 14-17 with accessorial (xenarthrous) lateral 
articulations. Abbreviations: az, anterior zygapophysis; alz, anterior lateral 
zygapophysis; pz, posterior zygapophysis; plz, posterior lateral zygapophysis. 


in that the erupting (replacing) molariform is lingual (side-by-side) to 
the deciduous molariform of the same tooth locus in the maxilla 
(Supplementary Video 1). Within gobiconodontids, Spinolestes differs 
from Gobiconodon**”* in having more incisors and fewer premolars. Of 
all gobiconodontids, Spinolestes is most similar to Repenomamus in 
dentition, but differs from Repenomamus species in having one fewer 
molariform each for uppers and for lowers, in enlargement of lower first 
incisor, and in having both labial and lingual cingula in upper molari- 
forms” (for full diagnosis and hypotheses of phylogenetic placement see 
Extended Data Fig. 3 and Supplementary Information). 


Description 


The skeleton is exposed on its right lateral side in the main slab after its 
transfer to resin embedding. The skull is exposed on its ventral side 
(Figs 1b and 2a and Extended Data Fig. 1) and has a rounded rostrum. 
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The jugals are complete, extending from anterior zygoma to the squa- 
mosal glenoid. The glenoid is transversely expanded. The petrosal has 
a cylindrical promontorium, and a narrow lateral trough. The occi- 
pital condyles are large and oval shaped. The mandibles are robust for 
their short length, and have no angular process. The dentary condyles 
are broadly oval and bent laterally from the plane of coronoid process 
and mandibular body. Meckel’s cartilage is ossified, preserved in situ 
on the left mandible but detached on the right side, exposing a wide 
Meckel’s groove on the right mandible (Supplementary Video 2). The 
middle ear bones themselves are lost, but given the curved Meckel’s 
cartilage, we infer the ear was medio-laterally separate from the 
mandible yet still connected antero-posteriorly to the mandible via 
the ossified Meckel’s cartilage, as in Yanoconodon, Liaoconodon, and 
other gobiconodontids*””*”>”8, 

Spinolestes shows lower molariform replacements at least at m1 to 
m2 loci (Fig. 1c and Extended Data Fig. 2a). The right m1 and m3 are 
in the process of eruption, whereas beneath deciduous m2 the germ of 
replacing m2 is visible. Upper molariform replacements occur from 
M1 to M4 positions. The right M1 is in the process of eruption. 
Deciduous M3 and M4 are heavily worn in an oblique angle, down 
to the roots, but still in place in the maxilla, while replacing M3 and 
M4 are erupting to the lingual side of DM3, DM4 although at different 
stages (Supplementary Video 1). Medial molariform replacement/ 
succession occurs only in the maxilla, but mandible shows vertical 
replacement of deciduous by erupting molariforms. Because upper 
M2 appears to have erupted earlier than M1 and M3, and lower m1 
and m3 erupted before m2, Spinolestes shows alternative replace- 
ments for these tooth loci. Alternating replacement is common in 
basal mammals”’, and is consistent with replacement patterns of 
anterior molariforms in Gobiconodon ostromi™. 
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The shoulder girdle is therian-like (Fig. 1a, b). The broad, triangular 
scapula has a high spine that ends in a cranially oriented acromion. Its 
posterior border is laterally bent, forming a secondary spine. As in 
therians, the glenoid fossa is oval, oriented nearly perpendicular to the 
scapular plate. The curved clavicle has a large, curved synovial facet on 
its distal end for mobile articulation with the acromion. The coracoid 
is small and fused to the scapula. There is no procoracoid. Five ossified 
sternal elements are present, plus the partially preserved interclavicle 
in the ventral part of the thorax. The claviculo-interclavicular joint is 
mobile, as documented for other eutriconodonts”™”*”’. 

Spinolestes has 16 thoracic vertebrae, seven lumbar vertebrae (defined 
by zygapophyses rotated to vertical, and by transverse processes), three 
fused sacrals, and 22 caudals. Twenty dorsal vertebrae bear ribs, of which 
16 pairs are associated with the thoracic and four pairs with the lumbar 
vertebrae. Ultimate and penultimate lumbars have fused ribs. The robust 
ribs are two-headed, and have rounded cross sections. The dorsal ver- 
tebrae have reclined spinal processes. Beginning with dorsal vertebra 11, 
the spinal processes become larger in the antero-posterior dimension, 
and develop an expanded flat top. This flat top becomes larger and more 
prominent in the successively posterior lumbars. 

Spinolestes shows accessory lateral xenarthrous articulations (sensu 
ref. 30) from dorsal vertebrae 9/10 to dorsals 16/17 (Fig. le, f and 
Extended Data Fig. 2d). The anterolateral aspect of the prezygapo- 
physis forms a lateral zygapophyseal articulation with the preceding 
vertebra. Among therian mammal groups, xenarthrous articulation is 
most prominently developed in South American xenarthrans (arma- 
dillos, anteaters, and sloths), and in some Eocene placentals'®’. 
Among Mesozoic mammaliaforms, a xenarthrous articulation is pre- 
sent in Fruitafossor’’. Besides Spinolestes, this derived structure is also 
present in other gobiconodontids according to our observation (also 


Figure 2 | New gobiconodontid S. xenarthrosus, 
holotype counter slab (MCCMLH30000B) and 
integumentary structures. a, Holotype counter 
slab with its half skeleton on original rock matrix. 
b, Guard hairs of dense fur (mane) in the 

cervical region (AT, atlas). c, Hairs and keratinous 
dermal scutes (SC) from portion of integument 
placed between dorsal vertebrae 11 and 14. 

d, Arrangement of compound hair follicles (inset) 
including hair bulbs (HB), primary hairs (PH), and 
secondary hairs (SH). e, Area with protospines, 
dermal scute (SC), and isolated tubules (inset). 

f, Protospines (PS) and isolated tubules (T) under 
translucent light. g, SEM image of a primary hair 
shaft’s longitudinal section displaying inner 
structure with cortex (C) and medulla of 
discontinuous type (M). h, Oval keratinous dermal 
scute (SC) dorsally of left ischium (ISC-L) with 
tubules and the homogeneous matrix. i, SEM image 
of compound follicle (FO) with primary (thicker) 
hair of mosaic cuticular pattern and secondary 
(thinner) hairs with coronary (annulus) cuticular 
pattern. j, Underfur hairs with annular cuticular 
pattern in the abdominal region under translucent 
light. c-f, j, Taken from transferred slab 
MCCMLH30000A. 
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ref. 27: figures 3-8 and 3-10). However, gobiconodontids including 
Spinolestes, and the mammaliaform Fruitafossor, lack the fully 
developed mammillary process (metapophysis*’) of the xenarthrous 
articulation seen in xenarthran placentals. 


Phylogenetic relationships 


Two phylogenetic analyses were conducted (Extended Data Fig. 3). 
One estimates the relationships of Spinolestes among major mamma- 
liaform clades (Extended Data Fig. 3b on a data set of 111 taxa and 490 
characters’”*), and the other evaluates the placement of Spinolestes 
among 20 genera of eutriconodonts (Extended Data Fig. 3a, on the 
basis of 90 dental and mandibular characters’*”°’***). Spinolestes is 
nested within Eutriconodonta as a member of Gobiconodontidae. 
Within Gobiconodontidae, Spinolestes is the sister-taxon of the clade 
Gobiconodon + Repenomamus. The finer sampling of eutriconodont 
taxa improves the resolution of the known eutriconodont phylo- 
genies'®**°?, In fact, gobiconodontids have a broad distribution from 
Asia to North America during the Early Cretaceous (Barremian to 
Albian), and are relatively abundant in Asia. The discovery of 
Spinolestes, plus two gobiconodontid teeth from the Early Cretaceous 
of Spain™ and Britain**, has extended the distribution of this family to 
Western Europe. This is consistent with the broad interchanges of 
other mammal groups on the Laurasian landmass during the Early 
Cretaceous”. 


Integumentary and soft-tissue structures 


The Spinolestes specimen shows regional hair differentiation and exquis- 
ite preservation of many integumentary structures (Fig. 2 and Extended 
Data Figs 1 and 4-8). A dense mane of long guard hairs (3-5 mm long) 
(Fig. 2b) is developed in the parietal, cervical, and scapular regions. Long 
and fine hairs are also present along the dorsal region, where they form a 
median crest, and along most of the tail. The rest of the body is covered 
by a soft and dense underfur (Fig. 2)). 

Multiple pieces of skin are preserved with microstructural details of 
hair such as primary and gradually smaller secondary hairs that are 
organized in compound follicles (Fig. 2c, d, i and Extended Data Figs 
4-6), as in many modern placentals****, marsupials, and monotremes”. 
In an isolated piece of skin between dorsal vertebrae 11 and 14, com- 
pound follicles are associated with scale-like folds of the skin (Extended 
Data Fig. 4c), identical to the pattern of hairy skin of extant dogs” 
(Extended Data Fig. 4d). The cuticular scales have a smooth surface. 
The cuticula of primary hairs shows an irregular mosaic of imbricate, 
ovate scales (Fig. 2i and Extended Data Fig. 5a, b). The cuticula of 
secondary hairs displays coronal scales (annulus and encircling the 
shaft) with either simple or serrate free margins (Fig. 2i and Extended 
Data Fig. 5b, c). The longitudinal section of a primary hair shaft exhibits 
a discontinuous medulla with serial chambers and septa, and a thick 
cortex made of typical fusiform, spindle-shaped cells (ref. 40: figure 3) 
(Fig. 2g and Extended Data Fig. 6e). In a few regions of the body, the 
hairs have short, truncated shafts less than 1 mm long (Fig. 2c, d) with 
dark-coloured distal ends. Compared with normal hairs, such features 
match the so-called ‘block hairs’ and ‘i-hairs’ that represent a patho- 
logical condition associated with a fungal infection (dermatophytosis) 
widely spread in extant mammals (ref. 41: figure 2.50). 

Spinolestes has what we call here protospines in the dorso-sacral 
region of the body (Fig. 2e, f), varying in basal diameters from 80 to 
130 jim. Each of these small protospines is made up of two or more 
longitudinally fused tubules that appear to have scaly external surface, 
and an interior medullary cavity surrounded by a cortical structure 
(Fig. 2f). These tubules are interpreted as modified primary and sec- 
ondary hairs. Most of the thicker protospines cluster with thinner 
ones and with isolated tubules, all of which are oriented randomly. 
A dozen or so oval dermal scutes up to 4 mm in dimension are present 
in the dorsal, lumbar, and pelvic regions (Fig. 2h and Extended Data 
Fig. 8). These scutes consist of a random arrangement of tubules 
merging to a homogeneous matrix. 
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Spines and hairs have fundamentally similar structural designs’. In 
embryogenesis, spines appear later than hairs and are developed from 
the fusion of several hair follicles either by the merger of multiple hair 
follicles of similar size, or by one large merging with several smaller 
surrounding hair follicles’. Scanning electron microscopy (SEM) 
revealed that the protospines of Spinolestes are formed by the merger 
of several hair-like tubules, which is similar to the development of 
spines from the fusion of hair follicles in extant mammals. This suggests 
that the evolution of spines was mediated in Mesozoic mammals by a 
similar process of fusion of multiple hair follicles, as in the embryogen- 
esis of integumentary structures in extant mammals. The unambiguous 
presence of compound hair follicles and their related scale-like skin 
folds in such a basal mammal clade as eutriconodonts confirm that 
these features are ancestral to mammals. Spinolestes further proves that 
mammalian underfur, guard hair, and spines had already fully differ- 
entiated in some Mesozoic mammals during the Early Cretaceous. The 
fact that multiple specimens of many taxa of eutriconodonts are pre- 
served with fur but lack spines makes the presence of protospines in 
Spinolestes unique among eutriconodonts, apparently evolving inde- 
pendently from monotremes (echidnas), and some placentals such as 
hedgehogs and spiny mice. Although spines of the spiny mouse Acomys 
consist of a single modified awl hair, spiny mice can serve as a modern 
analogue to Spinolestes with spiny hairs located on the lower dorsum, 
possibly for display and protection by easy loss of spines when bitten by 
predators on the back”. 

The specimen also shows a unique preservation of several organs. In 
the area of the scalp, the left outer ear (external pinna) is perfectly 
preserved (Fig. 2a and Extended Data Fig. 7). Within the thoracic ribcage 
of Spinolestes, a patch of fossilized soft tissues contains tubular structures 
with a branching pattern (Extended Data Fig. 9). From the position and 
distribution in the rib cage, this most probably represents fossilized lung 
tissue, and the branching structures probably represent the bronchioles 
of the lung™. Posteriorly to the lung tissue, a large oval area of reddish- 
brown soft-tissue (Extended Data Fig. 9a) is interpreted as residues of the 
liver according to its anatomical position and colour. Liver tissue is rich 
in iron, and provides a reddish colour, as has been reported for the 
theropod Scipionyx from the late Early Cretaceous (Albian) of Italy*. 
The boundary between the inferred lung and the liver tissues extends, 
obliquely, from the distal tip of the 3rd rib to the proximal end of the 
15th rib (Extended Data Fig. 9a), corresponding to the muscular dia- 
phragm in extant mammals**. This extends from the mid-sternal area 
to the posterior-most thoracic vertebrae, between the lungs in the 
thorax, and the liver in the abdominal cavity. Its presence reconfirms 
that this complex respiratory apparatus, which is tightly correlated with 
locomotion“, was already functional in Mesozoic mammals. 


Skeletal and locomotor adaptations 


The scaling of long bones and mandible to body mass*”** estimates that 
the weight of Spinolestes ranged from 52 to 72g (see biometrics in 
Supplementary Information), indicating that it was a medium-sized 
Mesozoic mammal, falling within the range of small extant didelphid 
marsupials (for example, the size of Monodelphis brevicaudata and 
Marmosa murina)”. Its limb proportions and xenarthrous vertebrae 
configuration together encompass a singular ecomorphological com- 
bination. The fibula is not co-ossified with the tibia, and is only slightly 
thinner than the latter. The carpals are relatively short, and the value of 
the phalangeal index” of digit III (equal to 121) corresponding to the top 
quartile of mammals with a terrestrial locomotory mode definitively 
indicates that it was not arboreal. The terminal phalanges lack a highly 
curved dorsal outline (Fig. 1d), but are robust and relatively wide, both of 
which are features common for extant mammals of terrestrial and fos- 
sorial lifestyles. Many extant placentals with xenarthrous vertebral 
articulations (or with reinforced thoracolumbar vertebrae in general) 
are capable diggers, such as armadillos and anteaters. The Jurassic mam- 
maliaform Fruitafossor exhibits striking convergent adaptations to 
armadillos such as reduced enamel-less peglike teeth and xenarthrous 
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vertebral articulations. The Eocene mammal Eurotamandua (putatively 
a palaeanodont) has edentulous jaws and strongly enlarged manual 
claws. Both taxa are considered as fossorial and feeding on colonial 
insects'®”*, In contrast to Fruitafossor, Eurotamandua, and xenarthrans, 
the dentition of Spinolestes does not exhibit any tendency to reduction. 
Thus, Spinolestes shows a mosaic of functional features, suggesting a 
terrestrial locomotion, with ambulatory gait, as in Gobiconodon**”, and 
potential digging abilities, with a more versatile lifestyle (Extended 
Data Fig. 10). The extant armoured shrew Scutisorex can serve as a 
modern analogue for the lifestyle of Spinolestes. Scutisorex is unique 
among mammals by its massive interlocking lumbar vertebrae, which 
provide an extraordinary vertebral strength. It has been hypothesized 
that Scutisorex uses this vertebral strength to force open the base of 
palm leaves to search for insects or larvae in swampy forests”. A 
similar lifestyle, supported by the strong arms and hands, is hypothe- 
sized here for Spinolestes, which lived in the vegetated Las Hoyas 
wetland environment. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Upon discovery, the rock containing the specimen was split into two slabs, each 
containing original bone, fossilized soft tissues, and natural moulds of bones. One 
slab (MCCMLH30000A) was transferred to a matrix of epoxy resin to expose 
undisturbed bone surface hidden in the rock and to obtain casts of the natural 
moulds of bone impressions. After embedding in epoxy resin, the limestone 
matrix was removed by a formic acid bath that dissolved the calcium carbonate 
but not the hydroxyapatite of bone and the phosphatized soft tissue. The trans- 
parent artificial plastic matrix allows examination of integument structures under 
translucent light. The other slab (MCCMLH30000B) was left untouched in the 
limestone matrix to keep its original embedding situation. 

The transferred slab and the non-transferred slab were scanned on a v|tome|x s 
uwCT-scanner at 240 kV (GE Electronics Phoenix|X-ray), 864 slices at a voxel size 
of 107.73 um. Resolution was electronically enhanced (voxel size 53.86 um) by 
Datos|x-reconstruction software (version 1.5.0. GE Sensing Inspection 
Technologies). Reconstructions (Supplementary Videos 1 and 2) were by Avizo 
(version 8.0, Visualization Science Group). 

Two phylogenetic analyses were performed using PAUP (4.0b10) to estimate 
the placement of Spinolestes among eutriconodont mammals and their imme- 
diate outgroups, and among major cynodont-mammal clades. The analysis of 
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eutriconodonts was based on a data set augmented from refs 19, 20, 32, 33. The 
matrix has 29 taxa (20 eutriconodonts plus 9 outgroups) and 87 parsimomy- 
informative characters. In this matrix, all characters have equal weight, six multi- 
state characters are ordered, and remaining characters are unordered. The 
branch-and-bound, which guarantees the shortest tree, yielded a single most 
parsimonious tree (tree length = 249; consistency index = 0.446; retention 
index = 0.755), as presented in Extended Data Fig. 3a. Nodal strengths of key 
clades are also assessed by bootstrap values (on the basis of a 50% majority rule 
consensus from 1,000 duplicates of bootstrap searches), and by Bremer index. 
Details in Supplementary Information. 

Phylogenetic analysis of Spinolestes among major clades of cynodonts and 
mammals was based on a data set expanded and modified from that of ref. 7. 
The data matrix has 112 taxa (including 8 eutriconodonts), 490 characters (all 
parsimony-informative). The phylogenies were estimated by 1,000 replicates of 
heuristic search with a tree bisection and reconnection branch-swapping algo- 
rithm. All multi-state characters are unordered; all characters have equal weight. 
The search yielded 172 equally parsimonious trees (each tree has a length of 2,386 
steps). A strict consensus tree of the 183 equally parsimonious trees is presented 
in Supplementary Information and a simplified tree is presented in Extended 
Data Fig. 3b. 
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Extended Data Figure 1 | New gobiconodontid S. xenarthrosus, holotype Arrow points to skin patch preserving hair bundles between dorsal vertebrae 11 
transferred slab MCCMLH30000A. Skull exposed in ventral aspect. Inset:left and 14. 
calcaneus in dorsal view on the same slab somewhat apart from the skeleton. 
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Extended Data Figure 2 | Dentition and xenarthrous vertebrae of right lower molariforms in lingual view. Abbreviations: C/c, upper/lower 
Spinolestes. a, SEM images (stereo pair) of right lower mandible and right canine; I/i, incisor; M/m, molariform; P/p, premolar; D/d, deciduous tooth; R/r, 


upper maxilla with molariforms. b, SEM image of anterior portion of skulland __ replacing tooth. d, SEM images (stereo pair) of dorsal vertebrae D 12 to D 19 in 
mandibles with dentition. c, SEM image of penultimate (m3) and ultimate (m4) _ lateral aspect with ribs. 
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Extended Data Figure 3 | Phylogenetic relationships of Spinolestes and bootstrap values on the key clades of this study). b, Evolution of mammalian 
patterns of mammalian integumentary structure in early mammalian integumentary structures (simplified tree). Data sets and full analyses are 
evolution. a, Position of Spinolestes within Eutriconodonta (Bremer index/ presented in Supplementary Information. 
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Extended Data Figure 4 | Skin structures of S. xenarthrosus. a, SEM image _ surface of Spinolestes with scale-like wrinkles. d, Schematic diagram of scale- 
of skin surface showing compound hair follicles (FO), epidermal cells like wrinkled skin folds with hair follicles of a dog (redrawn from ref. 38). 
(keratinocytes), and pores (P). b, Schematic drawings of scale-like skin folds e, SEM image of epidermal cells (keratinocytes) of a hair follicle. f, SEM images 
(SCF) with compound hair follicles (FO) of the dog with primary (PH) and of skin surface with polygonal epidermal cells (keratinocytes) and pores. 
secondary (SH) hairs of three ontogenetic stages (redrawn from ref. 38).¢,Skin _ g, Detail of the keratinocytes. 
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Extended Data Figure 5 | SEM images and interpretative line drawings of cuticular scale patterns. a-c, Primary (a, b) and secondary (b, c) hairs. 
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Extended Data Figure 6 | Hair structures of S. xenarthrosus. a,c,d, Patch — xenarthrosus showing fusiform cortical cells (FCC). f, Schematic diagram of 
of skin located between dorsal vertebrae 11 and 14 (arrow in Fig. 1 and human hair bulb (B) with cuticular scales (CU) and fusi (F) of the hair shaft 
Extended Data Fig. 1) on slab MCCMLH30000A under translucent light. (HS). g, Schematic diagram of a human head hair with fusiform cortical cells 
b, SEM image of an orifice of a compound follicle (FO) with primary hair (PH) (FCC) and fusi. f, g, Redrawn from ref. 40. Abbreviations: C, cuticula; HB, hair 
and three broken secondary hairs (SH). e, SEM image of a fraying hair of S. bulb. 
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Extended Data Figure 7 | External pinna of S. xenarthrosus. room temperature. The skull is detached from the scalp and has fallen to the 
a, b, Comparison with pinna and scalp of decaying rat (Rattus norvegicus):a,on bottom. Scalp and right pinna are still intact and connected to the torso of the 
transferred slab MCCMLH30000A; b, on counter slab MCCMLH30000B on _ floating carcass. d, Detail of c with well-preserved right pinna in original 
original limestone rock matrix. c, d, Pinna and scalp of decaying rat (Rattus position at the scalp. The mandible is displaced and loosely hanging down from 
norvegicus) for comparison. c, Head of decaying rat after 274 days in water at __ the skin. 
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Extended Data Figure 8 | Keratinous dermal scutes (SC) of S. xenarthrosus. a, Oval dermal scutes dorsally of dorsal vertebra 20. b, Oval dermal scute dorsally of 
left ischium (ISC-L). ¢, Oval dermal scute from dorsal region with tubules (T) merging to homogeneous keratinous matrix. 
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Extended Data Figure 9 | Visceral cavities of S. xenarthrosus with internal 
organs. a, Detail of the visceral cavities of MCCMLH30000B with separation of 
the anterior thoracic cavity (TC) containing the lungs (LU), and the posterior 
abdominal cavity (AC) containing the liver (LI). Dashed line represents the 
diaphragm. SE, sternal elements. b, Lung tissue in the anterior part of the 
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thoracic cavity. c, Peripheral conducting and acinar airways of the lungs with 
typical dichotomous structure of the bronchial tree. d, Peripheral conducting 
and acinar airways of the lungs from transferred slab MCCMLH30000A 
under translucent light. e, f, Details of the peripheral conducting airway 
dichotomies (DI) and acinar structures (ACI) of the lungs. 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


Extended Data Figure 10 | Lifelike reconstruction of S. xenarthrosus. 
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Glia-derived neurons are required for 
sex-specific learning in C. elegans 


Michele Sammut", Steven J. Cook’, Ken C. Q. Nguyen’, Terry Felton!, David H. Hall’, Scott W. Emmons”?, 


Richard J. Poole!* & Arantza Barrios!* 


Sex differences in behaviour extend to cognitive-like processes such as learning, but the underlying dimorphisms in 
neural circuit development and organization that generate these behavioural differences are largely unknown. Here we 
define at the single-cell level—from development, through neural circuit connectivity, to function—the neural basis of a 
sex-specific learning in the nematode Caenorhabditis elegans. We show that sexual conditioning, a form of associative 
learning, requires a pair of male-specific interneurons whose progenitors are fully differentiated glia. These neurons are 
generated during sexual maturation and incorporated into pre-exisiting sex-shared circuits to couple chemotactic 
responses to reproductive priorities. Our findings reveal a general role for glia as neural progenitors across metazoan 
taxa and demonstrate that the addition of sex-specific neuron types to brain circuits during sexual maturation is an 
important mechanism for the generation of sexually dimorphic plasticity in learning. 


During sexual maturation, the nervous system undergoes sexually 
dimorphic changes that couple behaviour to reproductive needs’. 
Sex differences in behaviour extend beyond courtship and mating to 
cognitive-like processes such as learning that also enhance reproduct- 
ive success’. However, the precise dimorphisms in neural circuit 
development and organization that generate these differences in 
behavioural plasticity are mainly unknown. 

C. elegans is a sexually dimorphic species with hermaphrodites that 
can self-fertilize and males that can only cross-fertilize hermaphro- 
dites. As a consequence, males display a repertoire of reproductive 
behaviours not performed by the hermaphrodite and these beha- 
viours require sex differences in the nervous system that develop 
during sexual maturation®’. The best described dimorphisms are 
the male-specific sensory and motor circuits required for mating. 
However, non-reproductive behaviours such as learning are also 
sexually dimorphic. Males, but not hermaphrodites, are capable of 
sexual conditioning, a process that involves associative learning and 
leads to a switch in chemotactic responses that facilitates effective 
mate finding’®. Sexual conditioning requires sex-shared and male- 
specific sensory neurons”, but whether the circuit also contains sexu- 
ally dimorphic interneurons for integration is not known. 


The MCMs are a novel class of interneurons 


We identified a previously unnoticed bilateral pair of neurons in the 
head of males that are required for sexual conditioning. We have 
termed them the MCMs (for mystery cells of the male) (Fig. 1). The 
identification of the MCMs is surprising given that the C. elegans 
anatomy, neuronal connectivity and developmental lineage have been 
characterized extensively**''"*. We noted the presence of the MCMs 
when we were analysing the expression pattern of a reporter transgene 
for the neuropeptide pdf-1. In males, but not in hermaphrodites, we 
observed pdf-1 expression in a bilateral pair of cells located dorso- 
anterior to the pharyngeal metacorpus, a region devoid of neuronal 
cell bodies in hermaphrodites (Fig. la-c). This region is comprised of 
glial and epithelial cells, some of which project anteriorly to the 


nose’*"*, Unlike any other cells in this region, we observed that the 
male’s pdf-1-positive cells send projections posteriorly into the nerve 
ring (Fig. 1b and Extended Data Fig. la, b). The nerve ring is the main 
concentrated neuropil of the nematode brain, where most sensory 
neuron-interneuron synapses and neural processing occur’’. We 
found that the pdf-1-positive cells also express the pan-neuronal 
reporter rab-3, suggesting they may be neurons. rab-3 expression 
was first observed at the L4 stage, when sexual maturation begins 
(Fig. 1c). Through reporter gene analysis, and ultrastructural recon- 
struction, we found that the MCMs are indeed fully differentiated 
neurons. In addition to rab-3, these cells express a battery of neuronal 
genes required for both electrical and chemical communication 
(Fig. 1d, Extended Data Fig. 1c and Extended Data Table 1). These 
include components of the SNARE complex; Na” and voltage-gated 
Ca?* channel subunits; innexins; components of the machinery for 
neuropeptide secretion ric-19 and ida-1; and pdf-1. At the ultrastruc- 
tural level, we observed dense-core vesicles in the MCM somata and 
projections, and clear-core vesicles at presynaptic densities (Fig. le). 
Together, these observations indicate that the MCMs are a novel class 
of male-specific interneurons. 

We identified the MCMs in electron micrographs of serial sections 
through the male head and established their entire neural connectivity 
(Fig. 2 and Extended Data Table 2). The cell bodies send processes 
posteriorly in the amphid bundles, from which they diverge to enter 
the nerve ring, and from which they exit and extend into the ventral 
nerve cord (Extended Data Fig. 1b). In the nerve ring and ventral 
nerve cord we identified synaptic interactions with 24 neuron classes 
(Extended Data Fig. 1d and Extended Data Table 2). The bulk of 
the synaptic input (65%) is from three interneuron classes, two of 
which are sex-shared (AVF and PVQ) and one of which is male- 
specific (EF). These interneurons receive extensive sensory inputs 
from the male copulatory circuits in the tail and extend processes 
through the ventral nerve cord and into the nerve ring, where they 
connect to the MCMs both directly and through RIF (a sex-shared 
class of second-order interneurons that receive input from head 
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Figure 1 | The MCMs are newly identified male-specific neurons. 

a-d, Lateral views (a, b) and dorsal views (c, d) of animals oriented anterior 
to the left. a, WormAtlas-style” diagram depicting the morphology and 
position, adjacent to the pharynx, of one of the bilateral pair of MCM neurons 
in the head of a male. b, Confocal projection of pdf-1::rfp expression in the head 
of an adult male (same region as in a). c, Expression of the pan-neuronal 
reporter transgene rab-3::rfp (Ras GTPase) in the head of an hermaphrodite 
and males at the third (L3) and fourth (L4) larval stages. The position of the 


chemosensory circuits) (Fig. 2). The largest synaptic output is onto 
the AVB pre-motor interneurons that drive forward locomotion. 
These outputs are both direct (41%) and indirect via RIF (14%). 
Thus, the presence of the MCMs creates a series of male-specific 
disynaptic feed-forward triplet motifs connecting male-specific EF 
and sex-shared AVF, PVQ and RIF to AVB (Fig. 2). Such triplet 


Promotes 
turns 


Head 
motor neurons 


Body 
motor neurons 


Figure 2 | MCM connectivity. a, Connectivity diagram of the MCMs showing 
the main inputs and outputs. The connections to the neurons known to regulate 
sexual conditioning (rays and CEMs), chemosensory plasticity (AIA, RIF), 
and salt sensation (ASE) are included. Grey and red connections indicate 
chemical and electrical synapses, respectively. The triplet motifs created by the 
MCMs are highlighted in black. The thickness of the lines is proportional to 
the anatomical strength of the connections, indicated in numbers (the number 
of EM serial sections of scored synaptic connectivity). Pink triangles, sensory 
neurons; red octagons, interneurons; purple circles, motor neurons; blue 
outline, male-specific. 
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MCMs is indicated with dashed red circles. L, left; R, right. d, Expression in 
the MCMs in adult males of reporter transgenes for neuronal markers. 

nca-1 (NALCN Na* channel subunit); inx-3 (gap junction innexin); snt-1 
(synaptotagmin); unc-64 (syntaxin); snb-1 (synaptobrevin); rgef-1 (Ras 
exchange factor); ida-1 (tyrosine phosphatase-like receptor, phogrin); ric-19 
(cytosolic, vesicle secretion). e, Electron micrographs of MCM showing dense- 
core vesicles (DCVs) (left) and a synapse (right). 


motifs recur frequently in the C. elegans nervous system*’’. There is 
also a small amount of reciprocal interaction between the MCMs and 
the male-specific, pheromone-sensing CEM head sensory neurons. 
The major input from mate-sensing circuits and their output onto 
second- and third-order interneurons make the MCMs ideally placed 
for the integration of mate-experience into circuits regulating beha- 
vioural plasticity to sensory stimuli. 


The MCMs regulate sex-specific learning 


We found that the MCMs are required for a male-specific switch in 
chemosensory behaviour induced by sexual conditioning. Sexual con- 
ditioning is a mate-experience-dependent process of behavioural 
modification that overrides the effects of starvation during chemo- 
sensory learning”’. This plasticity confers on males the ability to use 
recent environmental chemosensory cues as signals for effective mate 
finding, and reflects the adult male prioritization of sex over food’””*. 
One chemosensory behaviour that is subject to sexual conditioning is 
salt-avoidance learning”®. C. elegans is normally attracted to salt but 
both males and hermaphrodites can learn to avoid salt when it is 
previously associated with an aversive stimulus such as starvation’?”° 
(Fig. 3a). However, unlike hermaphrodites, males switch their beha- 
viour and become attracted to salt if mates are present during previous 
conditioning" (Fig. 3a). Ablation of the MCMs at the late fourth larval 
(L4) stage with a laser microbeam resulted in males that failed to 
undergo a sexually conditioned switch, thus avoiding salt rather than 
being attracted to it, after conditioning with starvation, salt and mates 
(Fig. 3b). Importantly, MCM ablation did not disrupt salt attraction in 
non-conditioned males or salt avoidance learning after conditioning 
with salt and starvation in the absence of mates (Fig. 3b), indicating 
that the defects are specific to sexual conditioning and not due to 
inability to sense salt or starvation. 

Sexual conditioning requires two types of mate inputs'®: a secreted 
pheromone synthesized by the enzyme DAF-22 and sensed through 
chemosensory amphid neurons and the male-specific CEM sensory 
neurons in the head’*”’, and a contact-dependent cue that is sensed 
through the male-specific ray neurons in the tail’®”’. MCM-ablated 
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Figure 3 | The MCMs are required for male-specific associative learning. 
a, Diagram depicts the salt-avoidance learning and sexual conditioning assay. 
CS, chemotaxis score. b, Salt chemotaxis scores of previously conditioned intact 
and MCM-ablated males [inIs179(ida-1::gfp);him-8(e1489)], and wild-type 
[him-5(e1490)] and pdf-1(tm1996) mutant males. n, number of individual 
animals tested. Error bars indicate s.e.m. Mann-Whitney U-test was used 

for statistical analysis. ***P < 0.001; **P < 0.01; *P < 0.05; NS = no 
statistically significant difference (P = 0.05). WT, wild type. 
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males responded efficiently to both daf-22-dependent secreted phero- 
mones (ascarosides) (Fig. 4a, b and Extended Data Fig. 2a) and con- 
tact-dependent hermaphrodite cues that initiate the mating sequence 
and inhibit male food-leaving behaviour””* (Fig. 4c, d and Extended 
Data Fig. 2b, f). These results indicate that the MCMs are not required 
for the sensory detection of mates, salt or starvation, but that they 
specifically regulate the integration of these cues to produce beha- 
vioural plasticity in a context- and sex-specific manner. 

Having found a role for the MCMs in sexual conditioning, we next 
asked whether the MCMs are important regulators of other male- 
specific behaviours from the coordinated sensory-motor programmes 
of mating” to other integrative behaviours such as the decision to 
leave food in search of mates”*°. MCM ablation did not cause 
defects in any other male-specific behaviour tested or overall fertility 
(Fig. 4c, d and Extended Data Fig. 2b-f), demonstrating a surprisingly 
specific role for the MCMs in sexual conditioning. 

To establish whether sexual conditioning is regulated through 
neuromodulation, we tested pdf-1(tm996) mutants in the salt chemo- 
taxis learning assay. The tm1996 deletion allele removes the minimal 
promoter and first exon of the pdf-1 gene and is likely to be a null”*”’. 
We found that pdf- 1 mutant males, like MCM-ablated males, failed to 
undergo sexual conditioning and avoided salt after conditioning with 
starvation, salt and mates (Fig. 3b). Loss of pdf-1, like loss of MCMs, 
did not disrupt salt sensation in naive animals or the ability to associate 
salt and starvation (Fig. 3b). pdf-1 mutants also responded to secreted 
and contact-dependent mate cues, although they were slightly, but 
significantly, less efficient than wild-type males at contact response 
(Fig. 4 a, b and Extended Data Fig. 2b)”’”. Together, these results tenta- 
tively suggest that the MCMs may modulate the circuits that regulate 
chemosensory plasticity through neuropeptide secretion. 


The MCMs arise from glia 


Next, we sought to establish the developmental mechanisms by which 
these male-specific neurons arise. Unlike any other neurons in 
C. elegans and other invertebrates, which arise from epithelial or 
undifferentiated blast cells*’*'*?8, we found that the MCMs arise from 
glial cells. Glia are specialized cells with projections that ensheath the 
cilia of the sensory neurons with which they are associated, and pro- 
vide structural and functional support to these neurons”. As neuronal 
gene expression in the MCMs begins during sexual maturation, we 
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control buffer is indicated by + ++, P< 0.001; 
+, P<0.05; -, no statistically significant differ- 
ence. n.d., not determined. c, d, Execution of 
mating sub-steps (c) and exploration in search of 
mates (d). + ++ indicates performance level 

of intact control males. 


Leaving 
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examined the expression of an rnr-1::gfp reporter, which labels cells in 
S-phase”, to determine whether the MCMs are born at this stage via 
an undescribed cell division in the male head. In the head region, we 
observed strong expression only in males (Fig. 5a). Expression was 
observed in a bilateral pair of cells at the late L3 stage, corresponding 
to the amphid socket (AMso) glial cells, and perduring in two bilateral 
pairs of cells at the early L4 stage, corresponding to the two AMso and 
the MCMs (Fig. 5b and Extended Data Fig. 3a). The lack of expression 
in the hermaphrodite head is consistent with the lack of AMso glial 
cell division in this sex’. 

Ablation of one of the bilateral pair of AMso glial cells at the L3 
stage in males resulted in a consistent loss of the MCM on the oper- 
ated side (Extended Data Table 3). Importantly, the MCM was not lost 
when the AMso was ablated at the mid-Lé4 stage, after the MCM was 
born (Extended Data Table 3). Together this demonstrates that the 
AMso glial cells are the MCM progenitors. 

The AMobo glial cells display a polarized morphology with a single 
projection running anteriorly along the anterior to posterior (A-P) 
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Figure 5 | The MCMs originate from a male-specific cell division of the 
AMso glial cells. Lateral views of animals oriented anterior to the left and 
dorsal to the top. a, b, Expression of the S phase reporter transgene rnr-1::gfp in 
whole animals at the L4 stage (a) and in the head of L3 and L4 males (b). 
The male-specific division in the head is indicated by dashed circles in a. The 
rest of expression corresponds to developing reproductive structures. 

c, Differential interference contrast (DIC) and fluorescent images of the AMso 
cell body with RFP-labelled histones during cell division. d, WormAtlas 
diagram depicting the morphology and position of the AMso and the MCM in 
the male head at the L3 and L4 stages. e, Head of animals with sex-reversal 
genetic manipulations of AMso. Feminization by expression of the grl-2::tra- 
2(IC)::SL2::mCherry transgene oleEx23. Masculinization by expression of 
grl-2::fem-3::SL2::mCherry transgenes oleEx18 and oleEx24. 
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axis to the nose’ (Figs 5d and 6a, b). Live imaging of the AMso 
cell division in males revealed that the cleavage plane was perpendic- 
ular to the axis of AMso polarity and that the AMso projection 
was maintained throughout the division (Fig. 5c and Extended 
Data Fig. 3b). The projection was inherited by the self-renewing 
AM6oo glial daughter. 


AMso plasticity is intrinsically regulated 

The AMso glial cells are born during embryogenesis from the same 
fully described lineages in both sexes (Extended Data Fig. 4a)"’, but 
retain their plasticity only in males, re-entering the cell cycle during 
sexual maturation. We found that the AMso developmental program 
occurred according to the genetic sex of the AMso cells and not 
according to that of the rest of their lineage or the animal. Sexual 
dimorphism in C. elegans is regulated by a genetic pathway that con- 
verges on the Ci/GLI-like zinc-finger transcription factor tra-1, which 
inhibits male development and activates hermaphrodite develop- 
ment*!*. Consistent with a role for the sex-determination pathway 
in MCM specification, 22/22 tra-1(e1488) sex-transformed XX 
pseudo-males produced MCMs (data not shown). The sister cells 
of the AMso are the male-specific CEM sensory neurons, which 
undergo cell death during embryogenesis in hermaphrodites"” 
(Extended Data Fig. 4a). In order to establish whether sex-specific 
neurogenic competence of the AMso glial cells is specified extrinsically 


a fe) 3 


L3 stage L3 stage L4 stage 


b 
c Glial markers 
L3 stage L4 stage Adult 
os ee, pee ee 
y # - ii m 5 - 
AND AMSQue cM AMso MCM 
— 
—  . —”  O—— 
= 
ptr-10 


ae —-——__. ___——. ee 
Figure 6 | The male AMso cells are fully differentiated glia before and after 
the division that generates the MCM neuron. Lateral views of animals 
oriented anterior to the left and dorsal to the top. a, AMso glial cell (green, grl- 
2:.gfp) and amphid sensory neuron dendrites (red, pdf-1::rfp) in an L3 
hermaphrodite and L3 and L4 males. b, Diagram of the AMso glial cell and the 
distal end of its projection to the nose, where it ensheaths the cilia of sensory 
dendrites. c, Expression in AMso and perdurance in MCM of reporter 
transgenes for glial markers in L3, L4 and adult males. ptr-10 (Patch-related 
receptor); itr-1 (IP3 receptor); grl-2 (Hedgehog-like/Ground-related). 
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or intrinsically, we expressed sex-reversing transgenes under cell- 
restricted promoters to uncouple the genetic sex of the AMso glial cells 
from that of the rest of the animal. We used the grl-2 and ztf-16 
promoters, which drive expression in the AMso cells after they are 
born and one or two other non-overlapping cell types in the head in 
larval animals****. Masculinization of the AMso cells in hermaphro- 
dites, by expression of a fem-3 transgene**”*, resulted in AMso cell 
division and MCM neuronal differentiation (Fig. 5e, Extended Data 
Fig. 4b and Extended Data Table 4). Conversely, feminization of 
AMso cells in males, by overexpression of the tra-2 intracellular 
domain (tra-2IC)'**’, resulted in lack of AMso cell division and lack 
of MCMs (Fig. 5e, Extended Data Fig. 4c and Extended Data Table 4). 
Masculinizing transgenes in males or feminizing transgenes in herma- 
phodites had no effect on AMso development (Fig. 5e). These results 
indicate that the competence of the AMso glial cell to become a neural 
progenitor is specified intrinsically, and extend the battery of sexual 
dimorphisms regulated by the sex-determination pathway to the 
intrinsic properties of glia. 


The male AMso cells are fully differentiated glia 


We found that the male AMso glial cells are fully differentiated 
specialized cells before they divide, indicating that the production 
of the MCMs from AMso involves a complete switch in terminal cell 
fate (Fig. 6). In hermaphrodites and adult males, the AMso cells are 
specialized glial cells that form a hole in the cuticle through which 
most amphid neurons contact the outside world”, (Fig. 6a, b). 
Hermaphrodite AMso cells also express a battery of glial-cell mar- 
kers**. We observed that at the L3 stage, before their division, the male 
AMso cells display the same characteristics as the AMso of hermaph- 
rodites and adult males: they project to the nose where they form a 
socket that ensheaths amphid neuron cilia (Fig. 6a, b), and they 
express AMso markers (Fig. 6c and Extended Data Table 5). 
Following AMso division, one daughter retains the AMso identity 
and the other becomes the MCM, which loses molecular and struc- 
tural characteristics of glia (Extended Data Fig. 5) and, as described 
above, acquires a neuronal identity. Similar complete switches in 
terminal cell fates have been described as transdifferentiation events 
in C. elegans and other systems'”?*. 


Discussion 


The production of the MCMs from AMso glial cells through a division 
that results in self-renewal and neuronal differentiation is equivalent 
to a snapshot of the reiterative neuronal differentiation events arising 
from radial glia cell divisions during vertebrate neurogenesis”. 
However, the extent to which vertebrate neural progenitors are fully 
differentiated glia remains unclear and, in particular, it is not known 
whether the support and progenitor functions of glia can be integrated 
within one cell****. Our results provide the first example of neurons 
arising from glia in a non-vertebrate organism and demonstrate that 
fully differentiated, functional glia can retain neural progenitor prop- 
erties during normal development. 

In C. elegans the best described transdifferentation event is the 
direct conversion of the rectal epithelial cell Y into a neuron”. 
Other cell fate switches in C. elegans and other systems, which are 
not direct but require cell division, have also been suggested to be 
transdifferentation events'’*°*!. The complete cell fate switch we 
observe suggests that the production of neurons from glia may be a 
process of natural transdifferentiation. It will be interesting to deter- 
mine whether direct and indirect transdifferentiations of this kind 
occur through similar molecular mechanisms. 

Our findings provide a direct link between developmental and 
anatomical sexual dimorphism in higher-order processing areas of 
the brain and sexually dimorphic behavioural plasticity during learn- 
ing. We have shown that sex differences in learning require the pro- 
duction, late in development, of a male-specific class of cephalic 
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interneurons that were not previously known to exist. These inter- 
neurons are incorporated into pre-existing, sex-shared circuits allow- 
ing the male to change behavioural priorities according to its new 
reproductive needs. The MCMs function to confer salience to pre- 
vious mate-experience during male navigation. Based on their con- 
nectivity, we suggest that these interneurons may function in other 
chemosensory-plasticity behaviours that may be subject to sexual 
conditioning. The production of more neurons in males is also a 
mechanism to generate cognitive sexual dimorphism in the song- 
learning system of songbirds, where neural progenitors have a glial 
identity**. However, whether these neurons are a male-specific class 
and whether their progenitors are intrinsically sexually dimorphic is 
not known. Our findings indicate that the addition of sex-specific 
neurons is an effective way of remodelling brain circuits during sexual 
maturation and reveal a general, possibly ancient role for glia as neural 
progenitors to assemble circuits for higher-order processing. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. No samples were 
excluded from analysis. For behavioural assays, investigators were blinded to the 
genotype or surgery of the animals during experiments and outcome assessment. 
Experiments testing response to secreted pheromones were randomized. 
Strains. him-5(e1490) was used as the wild-type background strain. BL5717, 
inIs179(ida-1::gfp);him-8(e1489) was used for MCM ablation and behavioural 
assays (data in Figs 3 and 4 and Extended Data Fig. 2). BAR4, otIs356(rab- 
3::rfp)him-5(e1490) was used for MCM ablation and behavioural assays (data 
in Fig. 4 and Extended Data Fig. 2). PT2248, pdf-1(tm1996);him-5(e1490). DR476, 
daf-22(m130). CB369, unc-51(e369) VT774, unc-36(e251);mals103(rnr-1::gfp+ 
unc-36 (+)). CB2823, tra-1(e1488)II;eDp6(IILf). BAR63, stIs10116 [his-72::his- 
24::mCherry::let-858 3' UTR + unc-119(+)J; itIs37 [pie-1::mCherry::H2B::pie-1 3' 
UTR + unc-119(+));him-5(e1490). 

Arrays for neuronal and amphid socket markers are indicated in Extended 

Data Tables 1 and 5, respectively. 
DNA constructs. Sex transformation constructs were made by PCR fusion of the 
regulatory sequences of grl-2 (ref. 34) or ztf-16 (ref. 33) directly upstream of the 
ATG of fem-3::SL2::mCherry or tra-2IC::SL2:mCherry'**>*’, These contructs 
include the unc-54 3’ UTR. The grl-2 promoter included 862 bp upstream of 
the ATG start codon (this is a smaller region of the promoter from that used 
in ref. 34). We checked transgenes driven by this promoter for embryonic 
expression and saw no expression before the birth of the AMso. The z#f-16 glia 
enhancer included the region —2,536 to —4,637 upstream of the ATG, as 
described in ref. 33. 

The following arrays were generated: oleEx18[grl-2::fem-3::SL2mCherry 
(20 ng ul?) + elt-2::9fp (40 ng ul ?)); oleEx24/, [grl-2::fem-3::SL2mCherry (8 ng ul 1)+ 
elt-2::efp (40ngpl *)]; oleEx19[grl-2::tra-2IC:SL2mCherry (50ng pl‘) + elt-2::9fp 
(30 ng ul’); oleEx23[grl-2::tra-2IC::SL2mCherry (10 ng ult) + elt-2::gfp (40 ng 
ul’); oleEx22[ztf-16::tra-2IC::SL2mCherry(5 ng ul!) + elt-2:: gfp(40 ng ul?)). 
Laser ablations. A standard protocol was used“. L1, L3 or L4 males were 
mounted on a glass slide on a 5% agarose pad with 25 mM NaN; as anaesthetic. 
For MCM candidate progenitor ablations the presence or absence of MCMs was 
identified with a rab-3 transgene (Extended Data Table 1). Epithelial cells were 
identified by morphology, position, and expression of the gaIs220 [col-93::HIS- 
24::mCherry + unc-119(+)] or syIs78[ajm-1::gfp+ unc-119(+ )] transgenes. Amphid 
socket cells were identified with glial markers (Extended Data Table 5). 

For MCM ablation and behavioural assays the strains BL5717, inIs179(ida-1:: 
GEP);him-8(e1489) and BAR4, otls356(rab-3::rfp)him-5(e1490) were used. 
Animals were left to recover for one day and then assayed. 

Behavioural assays. Sexual conditioning switch. Assays were performed as in ref. 
10 with some modifications. Assay plates and 50 mM NaCl conditioning plates 
were 5cm in diameter and contained 4.5 ml of 2% agar, 5 mM potassium phos- 
phate (pH 6.0), 1mM CaCl, and 1mM MgSO,  (salt-conditioning plates also 
contained 50 mM NaC)). A radial gradient of salt was created in the assay plates 
by adding 10 pil of 50 mM NaCl 2 cm away from the centre of the plate the night 
before the assay and then 5 itl of 50 mM NaCl three hours before the assay. Males 
and hermaphrodites were picked as L4s the night before the assay and transferred 
to single-sex plates with food. Animals were recovered from the food plates with 
wash buffer (1 mM CaCl,, 1mM MgSO,, and 5mM pH 6.0 potassium phos- 
phate) and centrifuged at 1,700 r.p.m. for 3 min. A total of three washes were 
performed to remove the food before placing the animals on the conditioning 
plates. Between 8 and 15 males were placed on each conditioning plate and 200 
hermaphrodites were used for sexual conditioning. Animals were conditioned for 
2h. Ablated males were sexually conditioned in the same plate as control males, 
tested blind, and identified afterwards based on the absence of MCMs. Animals 
were tested individually on a radial gradient of salt by placing them in the centre 
of the assay plate, 2 cm away from the source of NaCl. After 20 min, the tracks left 
by the animal were visualized and the chemotaxis score was calculated as in ref. 10 
by the sum of the scores of the regions by which the animal had travelled (Fig. 3a). 
At least five replicas were performed and on different days. Mann-Whitney 
U-test was used for statistical analysis. 

Response to secreted mate pheromones. Assays were performed as in ref. 22. 
Hermaphrodite-conditioned media was prepared by incubating wild-type or 
daf-22(m130) hermaphrodites in M9 buffer at a concentration of one hermaph- 
rodite per 1 pil of M9 for 3h. Animals were picked as L4 s the night before. Assay 
plates were regular NGM plates seeded with 50 tl of OP50. A pheromone spot 
(1 pl of hermaphrodite-conditioned M9, or diutions of Ascr no. 2 and Ascr no. 3 
in M9) and a control spot (1 pl of M9) were placed 3 mm apart. The two spots 
were interchanged every other trial to remove any bias. Upon drying, the spots left 
a visible rim on the food lawn. A population of 5 males (8 males for him-5 (e1490) 
and pdf-1(tm1996) trials) was placed 3mm away from the spots and video 
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recorded for 20 min. Videos were scored blindly for time spent in each spot. 
Each entrance to a spot was considered an independent event if after leaving, 
the worm had moved at least 2 mm away from the spot. The average duration of 
events was calculated and compared between spots and populations. Mann- 
Whitney U-test with Bonferroni correction was used for statistical analysis. For 
clear visualization of the data in the plotted graphs, data was logarithmically 
transformed. This results in normalization of the distribution of the data. t-test 
with Bonferroni correction was used for statistical analysis of the transformed 
data. Comparison of arithmetic (raw data) or geometric (transformed data) 
means did not affect the interpretation of the data. 

Seven independent population trials were performed for each condition except 
for daf-22(m130)-conditioned media (twelve), 600 nM Ascr no. 2 + 60 nM Ascr 
no. 3 (five) and 1200 nM Ascr no. 2 + 120nM Ascr no. 3 (four). We selected three 
different ascaroside dilutions according to the response elicited in intact males: 
600nM Ascr no. 2+60nM Ascr no. 3 (no robust and stochastic response), 
800nM Ascr no. 2+ 80nM Ascr no. 3 (robust but low response), 1,200 nM 
Ascr no. 2+120nM Ascr no. 3 (robust and strong response) (Extended 
Data Fig. 2). 

Response efficiency to mate contact. Animals were isolated as L4s the night before. 
For the assay, single males were placed with 30 mates (hermaphrodites) in a 
20 mm food lawn. Males were tested until they responded to a mate or for three 
minutes, whatever happened first. A male was scored as responding to mate 
contact if it placed its tail ventral down on the mate’s body and initiated the 
mating sequence by backing along the mate’s body to make a turn. The response 
efficiency was calculated by dividing a response by the total number of contacts 
made with the mate before responding. If a male did not respond within three 
minutes, it was scored as having 0 efficiency. Males were scored blindly. The 
Mann-Whitney U-test was used for statistical analysis. 

Turning and location of vulva (lov) assays. Single males (isolated the night before 
as L4) were tested on a 10 pl lawn of food (E. coli OP50) with 30 unc-51 her- 
maphrodites during 3 min. Upon response to contact, up to seven turns were 
scored as good or bad. Good turns: sharp ventral bend around the tip of the body 
of the mate, followed by uninterrupted scanning along the other side of the mate’s 
body. Bad turns: stutter (stops moving backwards before turning and moves 
forward before continuing with backward move); wide (tail positioned in a loose 
bend when turning around the tip of the mate); swim off (continues backing after 
reaching the end of the mate’s body and loses contact); or interrupted (after 
turning, the male does not continue backing along the mate’s body). The pro- 
portion of good turns out of total turns was scored. 

For location of vulva efficiency, the number of encounters with the vulva until 
it was first located was scored. The assay finished when the male located the vulva 
or after three minutes, whatever happened first. Location efficiency was scored as 
1 divided by the total number of encounters. 

The Mann-Whitney U-test was used for statistical analysis. 

Fertility assays. A single rab-3::rfp transgenic male was left to mate with two unc- 
51 hermaphrodites on a 15-pl lawn of food (E. coli OP50) for 24h and then 
removed. The hermaphrodites were transferred to a fresh plate every 24h for a 
total of three days. The proportion of cross-progeny from total progeny was 
calculated for three ablated individuals and three controls. t-test was used for 
statistical analysis. 

Food-leaving and retention assays. As described in ref. 27. 

Electron microscopy and serial reconstruction. Four samples were fixed by 
chemical fixation or high-pressure freezing and freeze substitution as previously 
described”. Ultrathin sections were cut using a RMC Powertome XL, collected 
onto grids, and imaged using either a Philips CM10 TEM or Zeiss Supra 40 FE- 
SEM. Sections were elastically aligned” and volumetrically reconstructed using 
TrakEM2 (ref. 51). The MCM cell bodies were identified in the EM sections based 
on position and morphology and in comparison to similar hermaphrodite sec- 
tions. This identity was further confirmed by the identification of a posterior 
projection, as no other cell with its soma in the same region is known to project 
posteriorly. This was followed by serial tracing of the projections to establish their 
morphology and connectivity. Synaptic connectivity and skeleton diagrams were 
determined using Elegance*. Circuit diagrams of connectivity were generated 
using Cytoscape”’. We estimated the anatomical strength of synaptic connectivity 
between two neurons by summing the number of serial sections where we 
observed the ultrastructural morphology presynaptic components using the same 
criteria as ref. 13. 
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Extended Data Figure 1 | The MCMs are newly identified male-specific 
neurons. a, WormAtlas-style diagram depicting the morphology and position 
of one of the bilateral pair of MCM neurons in the head of a male worm and its 
projection within the nerve ring and along the ventral cord. b, Volumetric 
reconstruction of the MCML cell body and projection based on tracing of serial 
EM sections. c, Co-expression of transgenes for neuronal markers in the rab-3- 
positive cells identified as MCMs (indicated with dashed red circles). All 
photographs are lateral views of animals oriented anterior to the left and dorsal 
to the top except for ric-19, which are dorsal views. Transgenes are listed in 
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Extended Data Table 1. pdf-1 (neuropeptide pigment dispersing factor); snb-1 
(synaptobrevin); ida-1 (tyrosine phosphatse-like receptor, orthologue of 
mammalian phogrin); ric-19 (rab-2 effector); nca-1 (NALCN Na™ channel 
subunit); ccb-1 (voltage-gated Ca”* channel subunit); unc-36 (voltage-gated 
Ca?* channel subunit); inx-3 (gap junction innexin). D, dorsal; L, left; R, 
right; V, ventral. d, Diagram of the neurons that directly connect to and from 
the MCMs. Triangles, sensory neurons; octagons, interneurons and 
unidentified neurons. The thickness of the arrows is proportional to the 
anatomical strength of the connections (Extended Data Table 2). 
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Extended Data Figure 2 | The MCMs are not required for other male- 
specific behaviours. a, Response of intact and MCM-ablated males 
(inIs179(ida-1::gfp);him-8(e1489) and otIs356(rab-3::rfp )him-5(e1490)) to 
dilutions of ascaroside pheromones (Ascr). Graphs represent Tukey box plots 
of logarithmic transformations of the data; n, number of independent events 
(that is, entry in scoring region). t-test with Bonferroni correction was used 
for statistical analysis. ***P < 0.001; **P < 0.01; *P < 0.05; n.s., no statistically 
significant difference (P = 0.05). b, Response efficiency to mate contact of 
intact, MCM-ablated and pdf-1(tm1996) mutant males measured as the 
proportion of responses out of total contacts with an hermaphrodite. Intact and 
MCM-ablated animals were inIs179(ida-1::gfp);him-8(e1489). Wild-type 
animals were him-5(e1490). A response indicates that the male placed its tail 
ventral down on the mate’s body and backed along it to make a turn. 
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c, d, Proportion of good turns (c) and location of vulva efficiency (d) of 
intact and MCM-ablated males (inIs179(ida-1::gfp);him-8(e1489) and 
otIs356(rab-3::rfp him-5(e1490)). e, Fertility (measured as proportion of cross- 
progeny) of intact and MCM-ablated males (otIs356(rab-3::rfp)him-5(e1490)). 
For b-e, n, number of individual animals tested. Error bars indicate s.e.m. 
Mann-Whitney U-test was used for statistical analysis. *P < 0.05; n.s., 

no statistically significant difference (P = 0.05). f, Mate-searching behaviour, 
measured as P;, values (probability of leaving food per hour) in the absence 
or presence of mates, of intact and MCM-ablated males (otIs356(rab-3::rfp )him- 
5(e1490)). n, number of individual animals tested. Two independent population 
assays were performed on different days. Maximum likelihood statistical 
analysis was used to compare P, values. Error bars indicate s.e.m. ***P <0.001; 
n.s., no statistically significant difference (P = 0.05). 
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Extended Data Figure 3 | The MCMsarise from a division of the AMso glial 
cell. All photographs are lateral views of animals oriented anterior to the left 
and dorsal to the top. a, Fluorescent photographs showing the two cells 
expressing rnr-1::gfp co-labelled with the glial marker ptr-10::rfp and the 
neuronal marker rab-3::rfp in the head of males at the early and late L4 stages. 
The AMso and MCM cell bodies are indicated with dashed lines. b, Fluorescent 
images of the AMso cell body and its projection at two time points during 
cell division. Photos are overexposed for visualization of the projection, 
indicated by arrows. The chromosomes are labelled with a histone::rfp 
transgene, and the AMso cell body is indicated by dashed lines. 
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Extended Data Figure 4 | AMso plasticity is regulated by AMso genetic sex. _rab-3::yfp reporter transgenes. In the head, the grl-2 promoter drives expression 
a, Diagram of the AMso and MCM lineage. b, c, Proportion of individuals in AMso and the excretory duct and pore cells, and the ztf-16 glial enhancer 
with MCMs in control animals and animals expressing sex-reversing drives expression in the AMso and amphid sheath glia. # indicates an 
transgenes in AMso. b, AMso masculinization with grl-2::fem-3::SL2::mCherry — independent transgenic array line for each manipulation. 7” test was used for 
transgenes (oleEx18 and oleEx24). c, AMso feminization with grl-2::tra-2IC:: statistical analysis; ***P < 0.001; n.s., no statistical significant difference 
SL2::mCherry transgenes (oleEx19 and oleEx23) and ztf- 16::tra-2IC::SL2:: (P = 0.05); n = number of animals scored. 


mCherry transgene oleEx22. MCM cell fate was identified with ida-1::gfp or 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a 
Proportion of MCMs expressing 
glial/neuronal markers 4 
; body 
wall 


j muscle. 


soe 
BRD © 


Proportion of MCMs 
Oo 
io 


0 

Stage L4 4 adult adult 
early late 1day 2 day 

n(cells) 40 40 40 40 
46 118 50 42 


@m glia marker 
(ptr-10prom:myrRFP) 


@ neuronal marker 
(ida-1prom:GFP) 


Extended Data Figure 5 | The MCMs lose molecular and structural 
characteristics of glia after birth. a, Proportion of MCMs with presence of the 
glial marker ptr-10::myrRfp or the neuronal marker ida-1::gfp at different stages 
after MCM birth. b, Electron micrograph of a cross-section of an adult male 
head showing the MCM and AMso cell body ultrastructure. Neighbouring 
tissues are colour coded following WormAtlas (http://www.wormatlas.org/ 
colorcode.htm). Purple (pharynx), muscle (green), hypodermis (light cream), 
AMso (amphid socket, pink). The dendrites of the amphid neurons (amphid 
bundle) are not colored. 
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Extended Data Table 1 | Reporter transgenes for neuronal markers tested for MCM expression 


protein/function gene array MCM 
expression 
Neuronal GTPase rab-3 otls291 (rab-3prom:: NLS YFP+rol-6) and otls356(rab- | + 62/62 
3prom::rfp) ™4 
Neuronal GEF rgef-1 evis111 (rGEFprom:: GFP) °° + 
Neuropeptide pdf-1 myEx696 (pdf-1prom::RFP+unc-122::GFP) *° + 
Phogrin/Exocytosis ida-1 inls179(ida-1prom::GFP) * + 59/59 
Neuropeptide exocytosis _ric-19 oe 73 (ric-19prom6::NLS-TagRFP) (gift from Hobert = + 
a 
Syntaxin unc-64  otEx4553 (fosmid-based transcriptional reporter) (gift ap 
from Hobert lab) °° 
Synaptobrevin snb-1 otEx5422 (fosmid-based translational reporter) (gift from + 
Hobert lab) 
Synaptotagmin snt-1 otEx5924( fosmid-based transcriptional reporter) (gift a 
from Hobert lab) °° 
Na* channel subunit nca-1 hpEx528 (nca-1prom::GFP) (gift from Zhen lab) + 
Ca channel subunit cecb-1 sEX10836(T28F2. 5prom::GFP+pCeh361) °° + 
Caz channel subunit unc-36 | sEx12299(C50C3.9prom::GFP+pCeh361) °” + 
Innexin inx-3 zwEx103 [inx-3prom::GFP +lin-15(+)] °° + 
VGLUT eat-4 otis388 (fosmid-based reporter) *' = 
Choline transporter cho-1 otls323 (fosmid-based reporter) * - 
VGAT unc-47 — otls348[unc-47prom(300bp)::mCHOPTI::unc54-3'UTR), — — 
pha-1(+)] °° 
Intraflagellar transport osm-6 mnls17(osm-6prom::osm-6:GFP) °° - 
(marker for ciliated 
neurons) 


Fatty acid hydroxylase daf-9 dhIs59(daf-9prom::GFP) “ = 
(marker for the XXX cell) 
Octopamine synthesis tbh-1 nls107[tbh-1prom::GFP+lin-15(+)] °° - 


Neuropeptide flp-21 nyls80 (flp-21prom::GFP) °° = 
Nematocin receptor ntr-1 IstEx33 [ntr-1prom::gfp] ©’ = 
Caz channel subunit cca-1 sEx14060(C54D2. 5aprom::GFP+pCeh361) °° - 
K: channel subunit unc-103 rgEx[unc-103prom::YFP + pBx1] ™° - 
Innexin inx-12 | zwEx112 [inx-12prom::GFP +lin-15(+)] °° - 
Innexin inx-13  zwEx113 [inx-13prom::GFP +lin-15(+)] °° - 
Innexin inx-5 zwEx105 [inx-5prom::GFP +lin-15(+)] °° - 
Innexin inx-12  zwEx112 [inx-12prom::GFP +lin-15(+)] °° - 


+indicates presence; — indicates absence. For integrated arrays, quantification is indicated. At least 15 animals were examined for each transgene. This Table cites refs 54-68. 
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Extended Data Table 2 | MCM connectivity 
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| type presynaptic postsynaptic #synapses #EM sections 

chemical | AVF MCM 16 61 chemical | MCM unk1" 2 4 
chemical | AVJ MCM 1 2 chemical |MCM unk10° 1 2 
chemical | AVM MCM 1 1 chemical | MCM unk11" 1 2 
chemical | BDU MCM 3 7 chemical |MCM unk12° 1 1 
chemical | CEM MCM 2 5 chemical | MCM unk13° 1 1 
chemical | CEP MCM 1 il chemical | MCM unk2- 1 3 
chemical | EF MCM 6 35 chemical | MCM unk3" 1 2 
chemical | MCM ADA 1 3 electrical MCM unk6- 1 1 
chemical | MCM ALM 2 3 chemical | MCM unk7” 3 7 
chemical | MCM AQR fa 3 chemical | MCM unk8 4 4 
chemical | MCM AVB 29 61 chemical | MCM unk9” 1 

chemical | MCM AVF 1 2 chemical | PVC MCM 1 3 
chemical | MCM AVH 1 2 chemical | PVQ MCM 1 4 
chemical | MCM BDU 1 3 chemical | PVS MCM 1 2, 
electrical | MCM CEM 1 1 chemical | RIC MCM 1 1 
chemical | MCM CEM 5 8 chemical | RID MCM 1 1 
chemical | MCM CEP 1 1 chemical | RIF MCM 10 20 
chemical | MCM EF 2: 3 chemical | RIM MCM 3 3 
chemical | MCM MCM 2 6 chemical | RUG MCM 1 2 
chemical | MCM PVC 1 2 chemical SMB MCM Z 3 
chemical | MCM PVN 3 3 chemical | unk14° MCM 3 15 
chemical | MCM PVU 1 il chemical | unk15° MCM 1 5 
chemical | MCM RIC. Zz 3 chemical | unk16" MCM 1 1 
chemical | MCM RID 1 3 chemical | unk4” MCM 2 3 
chemical | MCM RIF 11 23 chemical | unk5° MCM 1 1 
chemical | MCM RIM 3 i 

chemical | MCM RIPR 3 6 

chemical | MCM RMD 1 2 

chemical | MCM SIA 1 2 

chemical | MCM SMB 2. 3 
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*unk refers to neurons whose identity has not been confirmed unambiguously in the EM serial sections. 
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Extended Data Table 3 | Cell ablations of candidate MCM progenitors 


| ablated cell stage animals with loss of MCM/total 
AMso L3 6/7 
AMso L4 0/3 
HO seam cell L1 1/16 
H1 seam cell L1 0/2 
HO+H1 L1 0/1 


Hyp1 (lateral nucleus) L1 0/4 
H0O+Hyp1(lateral nucleus) | L1 o/1 
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Extended Data Table 4 | Mosaic analysis of sex-transformation arrays, scoring the presence of MCMs 


| sex reversal array | array in AMso and other cells array in other cells only 
Masculinised oleEx18 (grl-2prom::fem-3:SL2:mCherry) 89% (n=28) 0% (n=22) 
hermaphrodites 
Feminised males oleEx19 (grl-2prom::tra-2IC:SL2:mCherry) 0% (n=39) 100% (n=7) 
Feminised males oleEx22 (ztf-16prom::tra-2IC:SL2:mCherry) 2.5% (n=40) 87.5%(n=16) 


*Expression of oleEx18 (grl-2::fem-3::SL2::mCherry) in the head was observed in AMso, excretory duct and pore cells and sometimes in the pharynx and/or hypodermis. Expression of oleEx19 (grl-2::tra-21C:: 
SL2::mCherry) in the head was observed in AMso, excretory duct and pore cells and sometimes in the hypodermis. Expression of oleEx22 (ztf-16::tra-21C::SL2::mCherry) in the head was observed in AMso and 
sometimes in the amphid sheath and/or a neuron in the nerve ring. 
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Extended Data Table 5 | Reporter transgenes for glial/AMso markers 


protein/function glial subtype AMso 
expression expression 

Patched (PTCHD3) related ptr-10°° | nsis108[ptr-10prom::myristyl-Rfp] All glia + 

receptor 

Hedgehog-like/Ground- grl-2 °° sEx12852/T16G1.8prom::GFP+pCeh361] AMso and PHso a 

related 

Inositol trisphospate receptor itr-1'" | jwEx51(itr1promB::GFP+trol-6) and AMso + 


nsEX1153 [F16F9.3prom::mCherry+itr- 
1prom::CFP+rol-6(su1006)] 


C2H2 zinc-finger ztf-16* | oleEx22/[ztf-16enhancer::tra-2IC::SL2cherry+elt- AMso, AMsh, ce 
transcription factor 2::GFP] PHso 
Basic helix-loop-helix hih-17° | leEx1713[hlh-17prom::GFP+unc-119(+)] CEPsh - 


transcription factor 


Caspr (Neurexin itx-1 otEx[W03D8.6prom::GFP+rol-6] OL and IL glia - 
superfamily) 


+indicates presence; — indicates absence. At least 15 animals were examined for each transgene. AMso, amphid socket; PHso, phasmid socket; AMsh, amphid sheath; PHsh, phasmid sheath; CEPsh, cephalic 
sensilla sheath; OLL, outer labial; IL, inner labial. This Table cites refs 34, 35, 69-72 
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Molecular basis of ligand recognition 
and transport by glucose transporters 
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Kunio Hirata*°, Masaki Yamamoto’, Shilong Fan? & Nieng Yan 


1,2,3 


The major facilitator superfamily glucose transporters, exemplified by human GLUT1-4, have been central to the study 
of solute transport. Using lipidic cubic phase crystallization and microfocus X-ray diffraction, we determined the 
structure of human GLUT3 in complex with p-glucose at 1.5A resolution in an outward-occluded conformation. 

The high-resolution structure allows discrimination of both o- and f-anomers of D-glucose. Two additional 
structures of GLUT3 bound to the exofacial inhibitor maltose were obtained at 2.6 A in the outward-open and 2.4 A in 
the outward-occluded states. In all three structures, the ligands are predominantly coordinated by polar residues from 
the carboxy terminal domain. Conformational transition from outward-open to outward-occluded entails a prominent 
local rearrangement of the extracellular part of transmembrane segment TM7. Comparison of the outward-facing GLUT3 
structures with the inward-open GLUTI1 provides insights into the alternating access cycle for GLUTs, whereby the 
C-terminal domain provides the primary substrate-binding site and the amino-terminal domain undergoes rigid-body 
rotation with respect to the C-terminal domain. Our studies provide an important framework for the mechanistic and 


kinetic understanding of GLUTs and shed light on structure- guided ligand design. 


Cellular uptake of glucose is a fundamental process for metabolism, 
growth and homeostasis’. The SLC2 family glucose transporters 
(GLUTs) catalyse facilitative diffusion of glucose and other monosac- 
charides across biomembranes”*. The fourteen human GLUTs have 
specific spatial and temporal distributions and exhibit distinct trans- 
port kinetics, capacity and substrate selectivity*. 

GLUT1-4 are among the most rigorously characterized solute 
transporters. GLUT1, which was one of the first transporters to be 
characterized*’, has been a paradigm in the understanding of solute 
transport’. GLUT1 is the principal glucose transporter in erythro- 
cytes and blood-tissue barriers, and ubiquitous for basal-level glucose 
uptake’"°. GLUT2 represents the major hepatocyte isoform that 
allows both uptake and efflux of glucose in response to fed or fasted 
state. GLUT2 also mediates glucose transport in intestinal, kidney and 
B-pancreatic cells'’’*, GLUT3 is referred to as the ‘neuronal glucose 
transporter’ for its primary function in neurons, and it is also respons- 
ible for glucose uptake in sperm, preimplantation embryos and cir- 
culating white blood cells’. GLUT4 is responsive to insulin in 
adipocytes and muscles'*’. 

Inactivating mutations or mis-regulations of GLUTs are associated 
with deleterious diseases including GLUT1 deficiency syndrome (the 
De Vivo disease), Fanconi-Bickel syndrome, type 2 diabetes mellitus 
and Alzheimer’s disease*®'°. GLUT1 and GLUT3 are overexpressed 
in different types of solid tumours, where the demand for glucose is 
strikingly enhanced to compensate for ATP generation under anaerobic 
conditions (the Warburg effect)””*. Targeting the overexpression of 
GLUTSs for cancer diagnosis and potential therapy has drawn increasing 
attention”, exemplified by the positron emission tomography that 
monitors the uptake of 2-deoxy-2-['*F]fluoroglucose!*””. Glucose 
conjugations were designed to use glucose transporters for enhanced 
membrane permeation and tissue-specific delivery of anti-cancer 


drugs”. Structural determination of GLUTs, particularly in complex 
with ligands, is a prerequisite for ligand design and optimization. 

GLUTs function by the ‘alternating access’ mechanism, whereby 
the alternate exposure of the substrate-binding site(s) to either side 
of the membrane is achieved through cycles of conformational 
changes of the transporter”*'. The structure of GLUT1 was recently 
captured in an inward-open conformation”. Structures of XylE, the 
Escherichia coli homologue of GLUTs, are available in three states: 
ligand-bound and outward-occluded*’, inward-open****, and partly 
inward-occluded™. 

Here we present the in meso crystal structure of human GLUT3 
bound to D-glucose in an outward-occluded state at 1.5 A resolution. 
Structures of GLUT3 in complex with the exofacial competitive 
inhibitor maltose were also obtained in the outward-open and 
outward-occluded conformations. Structural comparison with the 
inward-open GLUT] reveals the molecular basis for substrate recog- 
nition and transport by GLUTs. 


Characterizations of the recombinant GLUT3(N43T) 


All the functional and structural characterizations of GLUT3 reported 
here were performed using the glycosylation-site-eliminated variant 
GLUT3(N43T), which will be referred to as GLUT3 hereafter. A 
proteoliposome-based counterflow assay was reconstituted following 
a modified protocol***’, and the substrate selectivity of the recombin- 
ant GLUT3 was examined qualitatively (Fig. 1a, b). Among fifteen 
tested hexoses and pentoses, D-glucose, D-galactose, D-mannose, 
p-xylose and D-fucose blocked more than 90% of the [*H]glucose 
uptake, and L-arabinose and D-lyxose inhibited more than half of 
the activity. The other sugars had little or no effect on the transport. 
Notably, the C3 hydroxyl (C3-OH) of the seven effective inhibitors 
exhibits the same equatorial configuration, suggesting that C3-OH 
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Figure 1 | Structure of human GLUT3(N43T) bound to D-glucose. 

a, b, Substrate specificity of GLUT3. The transport of p-[2-*H] glucose by 
recombinant GLUT3(N43T) protein was examined in proteoliposome-based 
counterflow assays in the presence of the indicated monosaccharides (a) and 
chemicals (b). All the experiments were independently repeated at least 

three times. Error bars represent s.d. L-DHA, L-dehydroascorbic acid; GlcN, 
glucosamine; GlcNAc, N-acetylglucosamine; G6P, glucose-6-phosphate. 

c, Overall structure of GLUT3 in the presence of D-glucose. GLUT3 is domain- 
coloured with pale green, pale cyan and pale yellow for the N-terminal, 
C-terminal and ICH domains, respectively. The ‘omit’ electron density of the 
bound ligand, shown as brown mesh, is contoured at 30. d, The structure 

of glucose-bound GLUT3 exhibits an outward-occluded conformation. The 
corresponding TMs in the four 3-TM repeats are coloured the same. The ligand 
is shown as black spheres. The surface electrostatic potential was calculated 
with PyMol”. All structure figures were prepared with PyMol. 


may be crucial for specific recognition by GLUT3. Supporting this 
notion, D-ribose, the C3 epimer of D-xylose, showed no inhibitory 
effect on the uptake of glucose (Fig. la and Extended Data Fig. 1a). 

Among the chemicals tested, L-dehydroascorbic acid, which was 
suggested to be a substrate for GLUTs*”®, effectively inhibited 
the counterflow of D-glucose by GLUT3. Addition of glucosamine, a 
reported substrate of GLUT2 (ref. 40), blocked up to 60% of the 
activity. In contrast, myo-inositol, N-acetylglucosamine or glucose- 
6-phosphate showed nearly no inhibition (Fig. 1b and Extended 
Data Fig. 1b). These results, consistent with characterizations per- 
formed in distinct systems”, verified the activity and selectivity of 
the glycosylation-site-eliminated recombinant GLUT3 protein. 


Structure of GLUT3 in complex with p-glucose 


The crystals of GLUT3 in the presence of D-glucose were obtained in 
the P2, space group using the lipidic cubic phase (LCP) method. The 
structure was solved with molecular replacement and refined to 1.5 A 
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resolution (Fig. 1c, Extended Data Fig. 2a-d and Extended Data 
Table 1). The transmembrane region of GLUT3 exhibits a canonical 
major facilitator superfamily (MFS) fold with the 12 transmembrane 
segments (TMs) folded into the N-terminal and C-terminal domains, 
each comprising ‘3 +3’ inverted repeats’. TM7 and TM 10 are discon- 
tinuous helices, hence named TM7a/7b and 'TM10a/10b, respectively 
(Fig. 1d). The N-terminal and C-terminal domains are connected by 
four helices (IC1-4), which together with the C-terminal helix IC5 
constitute the intracellular helical (ICH) domain. 

After assignment and refinement of the polypeptide chain of 
GLUT3 (residues 1-470), the omit electron density for a bound 
D-glucose was unambiguously recognized in the central cavity 
(Fig. 1c). The ligand is occluded from either side of the membrane 
in the outward-facing GLUT3. Therefore, the structure represents a 
substrate-bound and outward-occluded state (Fig. 1d). In addition to 
D-glucose, three monoolein molecules, which were the major compo- 
nents of LCP, were identified (Extended Data Fig. 2e, f). 


Recognition of a- and B-anomers of D-glucose by GLUT3 


Extensive inter-domain interactions constitute the extracellular and 
intracellular gates as well as the side walls that insulate the bound 
ligand from the surrounding environment (Extended Data Fig. 3 
and Supplementary Discussion). The high resolution provides unpre- 
cedented clarity towards substrate recognition by GLUTs. It is of 
particular interest that the 2F, — F. electron density revealed the 
existence of both «- and B-p-glucose anomers. Despite the prevailing 
presence of B-D-glucose in aqueous solution, the «-anomer exhibits a 
dominant occupancy of approximately 69% in the refined structure 
(Fig. 2a, b). 

D-glucose is bound asymmetrically within the central cavity 
(Fig. 2c, d and Extended Data Fig. 4a). Located halfway across the 
membrane height, it stands closer to the C-terminal domain, which 
provides the primary substrate accommodation site in the centre of its 
transport-path-facing surface. Six polar residues from the C-terminal 
domain, including Gln280 and Gln281 on TM7a, Asn286 on TM7b, 
Asn315 on TM8, Glu378 on TM 10a, and Trp386 on TM10b, contrib- 
ute eight hydrogen bonds to coordinate D-glucose. With respect to the 
N-terminal domain, D-glucose is closer to TM1 and TM5. Gln159 on 
TM5 is the only polar residue from the N-terminal domain that 
engages in hydrogen bonding to D-glucose (Fig. 2d-f). 

The a- and B-p-glucose anomers are similarly coordinated by 
GLUT3 except for the variation at C1-OH (Fig. 2e, f). The «- and 
B-C1-OH are recognized by Trp386 and Gln280, respectively. The 
ring oxygen and C1-OH in both anomers are hydrogen bonded to the 
side group of Gln159. The C2, C3 and C6 hydroxyls are each coordi- 
nated by two hydrogen bonds, including those between C6-OH and 
Asn315 and Glu378, C2-OH and Gln280 and Trp386, and C3-OH 
and Gln281 and Asn286. The C4-OH forms a single hydrogen bond 
with Asn286 (Fig. 2e, f). In addition to the polar interactions, the 
carbon backbone of the sugar ring is surrounded by hydrophobic 
residues including Phe24 on TMI, Ile162 and Ilel66 on TMS, 
Tle285 and Phe289 on TM7b, and Phe377 on TM10b (Fig. 2g). A 
monoolein molecule was found to participate in hydrogen bonding 
with D-glucose. Although the monoolein molecule should be physio- 
logically irrelevant, similar indirect hydrogen bonds between glucose 
and polar residues may be mediated through water molecules under 
physiological conditions (Extended Data Fig. 4b). 


Structures of GLUT3 bound to maltose 

While working with crystallization of glucose-bound GLUT3, we 
attempted to capture the outward-open state of GLUTs by exploiting 
exofacial competitors****. Details of this work can be found in the 
Supplementary Discussion and Extended Data Fig. 5. The structures 
of GLUT3 in complex with maltose were captured in two conforma- 
tions, outward-open at 2.6 A and outward-occluded at 2.4 A resolu- 
tions, respectively (Fig. 3, Extended Data Fig. 6 and Extended Data 
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Table 2). Please refer to the Supplementary Discussion and Extended 
Data Fig. 7 for detailed analysis of maltose coordination in the two 
conformations. Briefly, the maltose-bound, outward-occluded struc- 
ture is virtually indistinguishable from the glucose-bound GLUT3 
with an overall root mean square deviation of 0.25 A. The second 
glucose unit (Glc2) of maltose completely overlaps with D-glucose 
when the two structures are superimposed. The coordination of 
Glc2 is nearly identical to that of glucose except for Asn286, whose 
side group exhibits a distinct rotamer that recognizes the C3-OH of 
Glcl (Extended Data Fig. 7a, b). 


W386 
& 


TM7a 


Comparison of the outward-open and -occluded states 
Subtle relative rotation of the N-terminal and C-terminal domains 
towards the central cavity on the extracellular side is observed during 
the state transition from outward-open to outward-occluded. The 
N-terminal domains remain nearly rigid in the two structures, 
whereas prominent local rearrangements occur to TM7b in the 
C-terminal domain (Fig. 4a, b and Supplementary Video 1). 

To achieve the conformational switch from outward-open to out- 
ward-occluded, TM7b, which consists of four helical turns, is partly 
unwound and bent in the middle such that the first two helical turns 
tilt inwards. Meanwhile these two helical turns undergo an axial 
rotation by approximately 60 degrees, leading to the relocation of 
the bulky residue Tyr290 and the substrate-coordinating residue 
Asn286 into the transport path (Fig. 4b). Notably, an invariant glycine 
(Gly284 in GLUT3) constitutes the kink preceding TM7b, which 
may provide the flexibility for the pronounced structural shift 
(Supplementary Fig. 1). Whereas the endofacial region of the TM 
domain remains nearly unchanged (Fig. 4a, right panel), the ICH 
domain exhibits minor intra-domain rearrangements with the inward 
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Figure 2 | Coordination of the a- and B-p- 
glucose anomers by GLUT3. a, b, Both a- and 
B-anomers of D-glucose are identified in the 
structure of GLUT3. The 2F, — F, electron density 
maps, shown as blue mesh, for «- and B-p-glucose 
are contoured at 20 and 1o, respectively. c, The 
bound glucose is predominantly coordinated by 
the C-terminal domain. A cut-open side view of the 
semi-transparent surface electrostatic potential is 
shown. d, The position of the bound glucose 

with respect to the C-terminal and N-terminal 
domains. The substrate-facing sides of the 
C-terminal and N-terminal domains are shown 
in surface and cartoon representations. Only the 
a-anomer is shown. e, f, Coordination of «- and 
B-p-glucose by GLUT3 through polar interactions. 
The o- (e) and B- (f) anomers are coloured black 
and silver, respectively. Hydrogen bonds are 
represented by red dashed lines. g, Coordination 
of glucose by GLUT3 through van der Waals 
interactions. The residues from the N-terminal and 
C-terminal domains are coloured green and cyan, 
respectively. 
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motion of helices IC1/2/3/5 by 1 to 2.A during the outward-open to 
outward-occluded transition (Fig. 4c). 


Alternating access 

GLUT3 shares sequence identity of 66% and similarity of 80% 
with GLUT1 (Supplementary Fig. 1). Structural resolution of the 
two closely related GLUTs in three distinct conformations reveals 
the molecular basis for the alternating access cycle. The N-terminal 
domain remains almost rigid during a relative rotation of the two 
domains, while the discontinuous helices TM7/10 and their adjoining 
segments in the C-terminal domain undergo prominent local 
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Figure 3 | Structures of maltose-bound GLUT3 in the outward-open and 
outward-occluded conformations. a, Structures of GLUT3 in complex with 
maltose in the outward-open and outward-occluded states. The cut-open 
views of the surface electrostatic potential are shown to compare the contours of 
the central cavities in the two structures. b, The van der Waals surface of the 
two GLUT3 structures was calculated with the program HOLE”. The radii of 
the potential transport path are tabulated. The glucose and maltose binding 
zones are indicated by grey shades. Glc, glucose. 
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Figure 4 | Conformational changes between the outward-open and 
outward-occluded GLUT3. a, Structural comparison of GLUT3 in two 
conformations. The outward-occluded GLUT3 is domain-coloured and the 
outward-open one is coloured silver. The structures are superimposed relative 
to their N-terminal domains. TM7b and the adjoining segment of TM8 are 
coloured orange in the outward-occluded structure. The ICH domain is 
omitted in the right panel, where an intracellular view of the transmembrane 
domain is shown. Note that the structures of the outward-occluded GLUT3 
bound to maltose and glucose are nearly identical (Extended Data Fig. 7b). We 
hereby use the glucose-bound structure for comparison. b, Conformational 


rearrangements. TM7b is gradually bent towards the central cavity, 
while TM10b swings away from the transport path during the out- 
ward to inward transition (Fig. 5a-c). 

The rigidity of the N-terminal domain and the adaptability of the 
C-terminal domain may be determined by their distinct structural 
features (Extended Data Fig. 8). The internal core of the N-terminal 
domain is relatively hydrophilic. Seven water molecules were iden- 
tified within the N-terminal domain of the GLUT3 structures. 
A continuous strip of hydrogen bonds connecting the water 


Figure 5 | Alternating access of the glucose-binding site in GLUTs. 

a, Structural comparison of glucose-bound GLUT3 with the inward-open 
GLUT1. The two structures are superimposed relative to their N-terminal 
domains. GLUT3 is domain-coloured and GLUT1 (PDB accession code: 4PYP) 
is coloured silver. Inset, when the N-terminal domains of the two structures are 
superimposed, the bound ligands are not overlapped. For visual simplicity, 
the C-terminal domain of GLUT3 is shown as a thin ribbon. b, Intra-domain 
rearrangements of the C-terminal domains between the inward-open GLUT1 
and outward-occluded GLUT3. The segments that exhibit local structural shifts 
are coloured orange in GLUT1. The structures of GLUT3 and GLUT] are 
superimposed relative to their C-terminal domains in panels b-e. See 
Supplementary Video 2 for the morph that illustrates the conformational 
changes. c, TM7b and TM10b undergo prominent conformational changes in 
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changes of TM7b result in the transition from outward-open to outward- 
occluded. Right panel, an extracellular view of the GLUT3 structures 
superimposed against the N-terminal domain. The outward-open GLUT3 is 
shown in semi-transparent surface representation and TMs 7 and 8 in the 
two structures are shown as ribbon cartoon. See Supplementary Video 1 for 
the morph illustrating the conformational changes between the two states. 

c, Conformational shifts of the ICH domain. When the two structures are 
overlaid relative to the N-terminal domain, the ICH helices exhibit minor intra- 
domain motions indicated by the purple arrows. 


molecules and the polar residues on TM1 and TM4 extends through- 
out the internal core of the N-terminal domain, likely supporting 
the rigidity of the N-terminal domain during conformational shift. 
In contrast, the C-terminal domain is highly hydrophobic. This 
‘greasy’ interior may enable the structural adaptability of the 
C-terminal domain. 

Consistent with the extensive polar interactions with the 
C-terminal domains, the bound D-glucose in the outward-facing 
GLUT3 and the glucoside of n-nonyl-B-p-glucopyranoside (B-NG) 
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§§) Outward-open GLUT3 {§ Outward-occluded GLUT3 


Inward-open GLUT1 


Against N domain 


Against ICH 


the three indicated structures. d, Modelled glucose-binding in the inward-open 
GLUTI. The cut-open view of the semi-transparent surface electrostatic 
potential of GLUT1 is shown. Inset, the N-terminal domain residues may no 
longer engage in glucose-coordination in the inward-open conformation. The 
N-terminal domain residues that contribute to glucose binding in the outward- 
facing GLUT3 are shown as green sticks; the corresponding residues in the 
inward-open GLUT] are coloured blue. All the residues are numbered as in 
GLUT3. e, Rearrangements of glucose-coordination between the outward- 
occluded GLUT3 and inward-open GLUT1. The C-domain ligand-binding 
residues that exhibit prominent changes between the two states are 

coloured orange in the inward-open conformation. f, Conformational changes 
of the ICH domain. The colour code of the three structures follows that in 
panel c. 
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Figure 6 | Updated working model of GLUTs. a, An overview of the resolved 
structures of human GLUT1 and GLUT3. See Supplementary Video 3 for the 
morph that illustrates the conformational changes between the three structures. 
b, Schematic illustration of the alternating access cycle of GLUTs. The 
rearrangements of the substrate-binding site and the local structural shifts of 
TM7b and TM10b during the transport cycle are highlighted. The shaded states 
refer to the structures shown in panel a. The N-terminal, C-terminal and ICH 
domains are coloured green, cyan and yellow, respectively. The structural 
elements that undergo prominent local shifts during state transition are 
highlighted in orange. 


in the inward-facing GLUT1 completely overlap when the C-terminal 
domains, but not N-terminal domains, are superimposed (Fig. 5a, b). 
We modelled B-b-glucose in the structure of inward-open GLUT] at 
the position of the glucoside of B-NG (Fig. 5d). As all the concerned 
residues are invariant between GLUT1 and GLUT3, we used the 
residue numbering of GLUT3. 

In the inward-open structure, the N-terminal domain residues no 
longer engage in ligand binding (Fig. 5d, inset). Within the C-terminal 
domain, local structural shifts result in rearrangement of the 
substrate-binding site (Fig. 5e). Trp386 loses contact with the ligand 
due to the outward swing of TM10b. Consequently, the ring oxygen 
and a-C1-OH may lose interactions with the protein in this con- 
formation (Supplementary Video 2). 

In addition to the local shift of TM7/TM10, the ICH domain also 
undergoes pronounced conformational changes. Because of the pref- 
erential association of helices IC1/2 with the N-terminal domain and 
IC4 with the C-terminal domain, the intervening helix IC3 may 
function like a ‘door closer’ that restrains the opening degree of the 
N-terminal and C-terminal domains on the intracellular side (Fig. 5f 
and Supplementary Video 3). Helix IC5 is missing in the structure of 
GLUT1, implying its intrinsic flexibility due to the potential loss 
of interactions with other ICH helices in the inward-open state” 
(Extended Data Fig. 3c). The structural observation supports the 
notion that the ICH domain may serve as a latch to stabilize the 
outward-facing conformation of GLUTs””. 
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Discussion 


The structural analyses of GLUTs presented here and previously 
(Fig. 6a) provide the molecular basis to address the long-standing 
questions on substrate recognition and transport by GLUTs. The 
discrimination of o- and B-p-glucose by GLUTs and whether anom- 
erization is required has been controversial for decades**-”. The 1.5 A 
structure of glucose-bound GLUT3 reveals that GLUTs can recognize 
both anomers, hence anomerization may not be required for 
D-glucose transport by GLUTs (Fig. 2e, f). 

The structures also provide interpretation for the ‘asymmetry’ of 
ligand binding from the endo- and exofacial sides of GLUTs* (Figs 2, 
5 and 6b). The C-terminal domain provides the primary yet partial 
substrate-binding site composed of polar residues including Gln280 
and Gln281 on TM7b, Asn315 on TM8, and Glu378 on TM10a. 
Arrival of the substrate at the primary site from either the exo- or 
endofacial side may induce conformational changes including the 
local shifts of TM7b and TM10b along with the relative rotation of 
the N-terminal and C-terminal domains. Therefore, the substrate- 
binding site may undergo dynamic rearrangements during a trans- 
port cycle (Fig. 6b and Supplementary Video 2). 

The advances in the structural elucidation of GLUT3 and GLUT1 
provide the framework for understanding a wealth of experimental 
data of GLUTs over the last half century and provide the foundation 
for further kinetic and thermodynamic studies of this important 
class of transporters. The high-resolution structures offer important 
insight into substrate selectivity, which may illuminate the rational 
design and optimization of ligands targeting the Warburg effect 
through GLUTs. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Protein purification. The recombinant human glucose transporter GLUT1(N45T) 
was expressed and purified as described previously”. A similar protocol was 
applied for the expression and purification of human GLUT3 with minor 
modifications. 

The codon-optimized cDNA of human GLUT3(N43T) was synthesized and 

subcloned into a modified PfastBacl vector with an N-terminal 10 X His tag. 
Following the instructions for the Bac-to-Bac baculovirus expression system 
(Invitrogen), bacmids were generated in DH10Bac cells. The baculoviruses were 
generated and amplified in Sf-9 insect cells. For protein expression and purifica- 
tion, the Sf-9 cells were collected 72 h after viral infection, and disrupted using the 
dounce homogenizer for 80 cycles on ice. The membrane pellets were collected 
and homogenized in the buffer (25 mM MES pH 6.0 and 150 mM NaC]) contain- 
ing protease inhibitors (aprotinin at 0.8 [.M, pepstatin at 2 11M, and leupeptin at 
5 ug ml '; Amresco), and then solubilized with 2% (w/v) n-dodecyl-B-b-malto- 
side (DDM, Anatrace) at 4°C for 2h. The insoluble fraction was precipitated by 
ultracentrifugation (150,000g) for 30 min at 4 °C. The supernatant was incubated 
with Ni-NTA resin (Qiagen) for an additional 30 min at 4 °C. The resin was then 
rinsed with the buffer containing 25 mM MES pH 6.0, 500 mM NaCl, 30 mM 
imidazole, and 0.06% (w/v) 6-cyclohexyl-1-hexyl-B-p-maltoside (CYMAL-6, 
Anatrace) three times. The protein was eluted with the washing buffer plus 
300 mM imidazole and further concentrated to 10mg ml” '. For the transport 
assay, the protein was applied to Superdex 200 10/300 GL (SD200, GE 
Healthcare) pre-equilibrated with buffer containing 25mM MES 6.0, 150 mM 
NaCl, and 0.06% (w/v) CYMAL-6. Peak fractions were collected for the transport 
assay. For crystallization, 50mM D-glucose or maltose (Sigma) was added 
throughout the purification procedure. Concentrated protein was applied to 
HiTrap Desalting 5ml (GE Healthcare) in the SD200 buffer plus 50mM 
D-glucose or maltose. 
Crystallization. GLUT3(N43T), which eluded our extensive crystallization trials 
using a conventional vapour-diffusion method, was crystallized with the lipid 
cubic phase (LCP) approach. To prepare the cubic phase for crystallization trials, 
the protein was concentrated to 30-40 mg ml * before mixing with monoolein 
(Sigma) in 1:1.5 protein to lipid ratio (w/w) using a syringe lipid mixer”’. 

For crystallization of glucose-bound GLUT3, the 40 nl meso phase was mixed 
with 900 nl crystallization buffer for each condition on glass sandwich plates 
(Shanghai FAstal BioTech) using a robot arm Gryphon (ARI). Crystals appeared 
within one week with a typical size of 70 jm X 50 ,tm X 10 um. These crystals 
diffracted X-rays to approximately 2.5 A at SSRF beamline BL17U. Various strat- 
egies were exploited to optimize crystals. Finally, the mother liquor containing 
28% (v/v) PEG400, 0.1 M HEPES pH 6.8, and 50 mM ammonium citrate gave rise 
to crystals with a size of approximately 140 jm X 100 pum X 20 pum. 

For crystallization trials of maltose-bound GLUT3, 30-45 nl meso phase was 
overlaid with 800 nl of precipitant solution. Crystals appeared overnight under 
two similar conditions. The crystals corresponding to the outward-occluded 
conformation appeared in the crystallization conditions with 38-40% (v/v) 
PEG 400, 100mM Mg(CHO,)2, 50mM maltose, and 100mM ADA pH 6.5. 
The crystals corresponding to the outward-open conformation appeared in 
34% (v/v) PEG 400, 400 mM (NH4)2HPO,, 50 mM maltose and 100 mM ADA 
pH 6.9. Crystals grew to a maximum size of about 20 X 20 X 5 um? at 20°C 
within one week. 

The crystals were collected using MicroMesh (M3-L18SP-50; MiTeGen) and 
immediately flash frozen in liquid nitrogen. 

Data collection and processing. The data set of glucose-bound GLUT3 was 
collected at the microfocus beamline BL32XU at SPring-8, Japan. The LCP crys- 
tals of GLUT3 were fast-screened with the shutter free strategy developed by 
SPring-8. Employing the microfocus X-ray beam of size 1 jm X 12 1m and the 
‘helical collection strategy’, we were able to collect a complete data set with a single 
elongated crystal. All diffraction data sets of maltose bound GLUT3 were also 
collected at the same beamline. There are generally 10-15 crystals in each 
MicroMesh. Diffraction data from 7 and 9 crystals in two different lattices were 
integrated and scaled using HKL2000, respectively**. Further processing was car- 
ried out with programs from the CCP4 suite*’. Data collection and structure 
refinement statistics are summarized in Extended Data Tables 1 and 2. The data 
sets were processed with the HKL2000 packages”. 

Structure determination and refinement. The phase was solved by molecular 
replacement using PHASER™ with GLUT1 (PDB code: 4PYP) as a searching 
model. The model was first modified by CHAINSAW”, then the N-terminal 
domain and C-terminal domain were taken separately as input ensembles for 
PHASER. The model was further rebuilt in COOT” and refined with PHENIX”’. 
The sequence docking was further aided by sequence alignment with GLUT1. 
The percentages of occupancy for «- and B-D-glucose were determined by the 
PHENIX occupancy refinement strategy. The two anomers were considered 
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as a constrained occupancy group where the occupancies of atoms in «- and 
B-p-glucose are coupled. The calculated occupancies for the «- and B-anomers 
were 0.69 and 0.31, respectively. For Supplementary Video 1 the intermediate 
morphs were obtained with the multiple-chain morphing script**’ for 
Crystallography & NMR system (CNS)°*". For Supplementary Video 2, a homo- 
logy-based structural model of the inward-open GLUT3 was generated based on 
the structure of GLUT1 using the online SWISS-MODEL workspace. 
Preparation of liposomes and proteoliposomes. Liposomes were prepared as 
described previously*’. For the counterflow assay, proteoliposomes were prepared 
in solution containing KPM6.5 buffer (50mM potassium phosphate pH 6.5, 
2mM MgSO,), 20mgml™! pre-extruded phospholipids (E.coli polar lipid 
extract; Avanti), and 50mM p-glucose. After incubation with 1% n-octyl-B-p- 
glucopyranoside (fB-OG; Anatrace) for 60 min at 4 °C, purified GLUT1(N45T) 
or GLUT3(N43T) (10 pg protein per mg lipid) was added and incubated for an 
additional 60 min at 4°C. B-OG was removed by incubation with 240 mg ml * 
Bio-Beads SM2 (Bio-Rad) overnight and an additional 2h incubation with 
120 mg ml! Bio-Beads. Proteoliposomes were then frozen and thawed in liquid 
nitrogen for five cycles and extruded through membrane filter (PC Membranes 
0.441m, Avanti). In order to remove the excessive glucose, proteoliposomes 
were collected by ultracentrifugation at 100,000g for 1h and washed with ice- 
cold KPM 6.5 buffer. Finally, the proteoliposomes were resuspended in KPM 6.5 
buffer to a final concentration of 100 mg ml’ (phospholipids). 

Counterflow assay. For each assay, 21 of concentrated proteoliposomes 
(GLUT1 or GLUT3) prepared following the above protocol were added 
into 100pl KPM6.5 buffer plus 14Ci p-[2-*H]glucose (specific radioacti- 
vity 21.5 Ci mmol !, PerkinElmer). The final concentration of the external 
p-[2-*H] glucose was 0.46 |\M. The uptake of radiolabelled substrates was stopped 
at 30s by rapidly filtering the solution through 0.22 tm filters (Millipore). The 
filter membranes were washed with 2 ml ice-cold KPM 6.5 buffer immediately, 
solubilized with 0.5 ml Optiphase HISAFE 3 (PerkinElmer) and used for liquid 
scintillation counting with MicroBeta JET (PerkinElmer). Liposomes without 
protein were tested as a negative control. 

For competition assays, the indicated sugars and chemicals were added into 
the external KPM 6.5 at 50 mM and the transport reaction was stopped at 30s. 
The reading of the competition assays was normalized against the one without 
competitor, which had the uptake of radiolabelled glucose set as 100%. All 
counter-flow assays were performed at 25°C and repeated at least three times. 
Error bars represent s.d. 

Data reporting. No statistical methods were used to predetermine sample size. 
The experiments were not randomized. The investigators were not blinded to 
allocation during experiments and outcome assessment. 
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Extended Data Figure 1 | Chemical structures of the tested monosaccharides, _ uptake of [2-*H]glucose in the proteoliposome-based counterflow assay are 


glucose derivatives, and other chemicals in the competition assay. a,b, The _ labelled brown. Those having moderate or no inhibition are labelled blue and 
monosaccharides (a) and chemicals (b) that exhibit potent inhibition to the green, respectively. 
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Extended Data Figure 2 | Protein purification, crystallization and limit beyond 1.5 A resolution. d, Crystal packing of GLUT3 in the space group 
structural determination of GLUT3(N43T) in the presence of D-glucose. of P2;. Three perpendicular views are shown. Each GLUT3 is domain- 
a, A representative chromatogram of the size-exclusion chromatography coloured and shown as a ribbon. The bound glucose is shown as a black sphere 


purification of GLUT3. The peak positions were applied for SDS-PAGE and _and the bound lipid molecules are shown as blue sticks. e, Electron densities of 
followed by Coomassie blue staining. b, The crystal of GLUT3 used for X-ray _ three bound monoolein molecules. The 2F, — F- electron density maps for 
diffraction which led to the final structural determination at 1.5A resolution. _ the bound lipid molecules are contoured at 1.00. f, Positions of the three bound 
c, A representative image of X-ray diffraction. The inset shows the resolution monoolein molecules relative to the protein structure. 
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Extended Data Figure 3 | The intracellular and extracellular gates of 
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c, Intra-domain interaction of the ICH domain. The polar interactions 


GLUT3. a, The N-terminal, C-terminal and ICH domains interact with each 
other through an extensive network of direct and water-mediated hydrogen 
bonds on the intracellular side of the membrane. Water molecules are shown as 
red spheres. Hydrogen bonds are represented by red dashed lines. b, The 
polar interactions between the N-terminal and C-terminal domains on the 
intracellular side. An ICH domain-omitted intracellular view is shown. 

Note the pseudo two-fold symmetry of the overall structure and the interacting 
residues (inset on the right). These polar interactions partially constitute 

the intracellular gate of GLUT3 in the outward-facing conformation. 


between helices IC3 and ICS5 are shown. d, The intracellular gate of GLUT3 
involves the Sugar Porter family (SP)-signature motifs. Sequence alignment of 
the 14 human GLUTs were performed with ClustalW. The secondary 
structural elements and the SP motifs are shown on the top and bottom, 
respectively. The residues coloured red are the highly conserved residues that 
constitute the intracellular gate illustrated in panel b. e, The extracellular gate of 
GLUTS3 in the outward-occluded state. An extracellular view is shown. f, g, The 
polar and hydrophobic residues mediating the lateral interactions of the 
N-terminal and C-terminal domains. 
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Extended Data Figure 4 | p-glucose coordination by GLUT3. a, The 
C-terminal domain provides the primary accommodation site for glucose in 
GLUT3. The «- and B-p-glucose anomers are coloured black and silver, 
respectively. The orange line in the right panel indicates the approximate 
interface between the N-terminal and C-terminal domains viewed from the 
extracellular side. Note that the ligand is located closer to the C-terminal 


domain. b, One monoolein molecule contributes to substrate coordination. The 
two monoolein molecules bound in the cavity of GLUT3 are coloured silver and 
light purple. One monoolein molecule mediates indirect hydrogen bonds 
between C2 and C3 hydroxyl groups of the bound glucose with the side groups 
of Thr28 and Gln281 of GLUT3. The effect of monoolein on glucose binding 
and transport has not been characterized. 
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Extended Data Figure 5 | Attempts to obtain the structure of outward- that may stabilize GLUTs in an outward-open conformation. The indicated 


facing GLUTs. a, Presence of the detergent molecule B-NG helped stabilize the disaccharides, where the C4-OH of D-glucose is condensed with another 
inward-open conformation of GLUT1™. b, The glucopyranoside of B-NG and _ hexose, were tested for their abilities to inhibit glucose transport by GLUT1 
p-glucose are coordinated similarly by the inward-facing GLUT1 and the or GLUT3 in the proteoliposome-based counterflow assay. Control refers to 
outward-facing XylE*, respectively. In both structures, the C4-OH is the conditions where no competitor was added. 

positioned towards the extracellular side. c, Identification of potential ligands 
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Extended Data Figure 6 | Structure determination of GLUT3 in complex 
with maltose in the outward-open and outward-occluded conformations. 
a, There are two GLUT3 molecules in each asymmetric unit of the outward- 
open structures. b, The crystal packing of the outward-open GLUT3. 

c, Structural superimposition of the two molecules in each asymmetric unit 

of the outward-open GLUT3. The two molecules exhibit nearly identical 
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Space group: P21 


conformations except for the extracellular loop regions. The focus was on 
molecule (Mol) A for structural analysis and comparison in the main text. 

d, The crystal packing of the outward-occluded GLUT3 bound to maltose. 
There is only one molecule in each asymmetric unit. e, f, The 2F, — F, 
electron density maps for the bound maltose in the outward-occluded (e) and 
outward-open (f) structures of GLUT3, both contoured at lo. 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a 


Maltose 


Extended Data Figure 7 | Maltose coordination in the outward-open and 
outward-occluded GLUT3. a, Coordination of maltose in the outward- 
occluded GLUT3 structure. Details of the polar interactions of maltose with 
residues from the N-terminal and C-terminal domains are shown in the insets 
on the bottom and right, respectively. Hydrogen bonds are represented by 
brown dashed lines. b, The structure of maltose-bound outward-occluded 
GLUT3 is nearly identical to that of the glucose-bound GLUT3. The second 
glucose unit (Glc2) of maltose completely overlaps with D-glucose, which is 
bound to the outward-occluded GLUT3. c, The bound maltose molecules are 


é 
F24 
Gle2 
T1162 TM10a 
Glc1 
Glc2 a-D-glucose 


positioned similarly in the outward-open and outward-occluded GLUT3. The 
two structures of GLUT3 are superimposed relative to the C-terminal domain. 
The outward-open GLUT3 is domain coloured and the outward-occluded 
structure is coloured pale purple. d, Maltose coordination in the outward-open 
GLUT3. Note that the coordination of Glc2 by the outward-open GLUT3 is 
identical to that by the outward-occluded GLUT3, while the polar residues in 
the N-terminal domain are not involved in the coordination of Glc1 in the 
outward-open structure. 
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Extended Data Figure 8 | The molecular basis underlying the rigidity and —_ hydrophobic core, which may allow the adaptability for the intra-domain shifts 


adaptability of the N-terminal and C-terminal domains, respectively. of the C-terminal domain during the transport cycle. c, Water distribution in 
a, Structural feature of the N-terminal domain of GLUT3. The interior of the the structure of glucose-bound GLUT3. The bound water molecules are 
N-terminal domain is held through a strip of hydrogen bonds, which may shown as red spheres. Identical views are shown for the two panels except that 
provide the molecular basis for the rigidity of the N-terminal domain duringthe __ the protein is omitted in the right panel to better illustrate the density of water 
alternating access cycle. b, The C-terminal domain of GLUT3 has a molecules with respect to the N-terminal, C-terminal and ICH domains. 
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Extended Data Table 1 | Statistics of data collection and refinement of GLUT3 bound to p-glucose 


Data GLUT3/D-glucose 
Integration Package HKL2000 
No. of crystals 1 
Space Group P2; 
Unit Cell (A) 48.34, 118.13, 51.34 
Unit Cell (°) 90, 102.67, 90 
Wavelength (A) 1.0000 
Resolution (A) 40~1.50 (1.55~1.50) 
Rmerge (%) 8.6 (57.8) 
I/sigma 12.7 (3.6) 
Completeness (%) 98.8 (91.8) 
Number of measured 444,895 
reflections 
Number of unique reflections 88,120 
Redundancy 18.0 (2.0) 
Wilson B factor (A’) 17.2 
Rwork / Rfree (%) 17.53/19.42 
No. atoms 
Protein 3,638 
main chain 1,897 
side chain 1,741 
Substrate 12 
Water 101 
Others 75 
Average B value (A’) 
Protein 25.08 
main chain 22.93 
side chain 27.43 
Substrate 18.91 
Water 40.50 
Others 48.40 
R.m.s. deviations 
Bonds (A) 0.008 
Angle (°) 1.095 
Ramachandran plot statistics 
(%) 
Most favorable 97.0 
Additionally allowed 3.0 
Generously allowed 0.0 
Disallowed 0.0 
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Extended Data Table 2 | Statistics of data collection and refinement of GLUT3 bound to maltose. 


Outward-occluded 
GLUT3/maltose 
HKL2000 
7 
P2) 

48.46, 119.49, 53.91 
90, 103.75, 90 
1.0000 
40~2.40 (2.49~2.40) 
13.0 (35.1) 

12.7 (3.6) 

94.6 (91.4) 
72,217 


22,422 
3.2 (2.8) 
353 
18.17 / 22.50 
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Outward-open 
GLUT3/maltose 
HKL2000 
9 
P2, 

78.11, 121.88, 96.08 
90, 108.14, 90 
1.0000 
40~2.60 (2.69~2.60) 
14.3 (51.8) 

10.5 (2.6) 

95.0 (97.3) 
175,711 


49,602 
3.5 (3.6) 
84.6 
21.54 / 25.08 
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Structure and mechanism of the 
mammalian fructose transporter GLUTS 


Norimichi Nomura>?**, Gregory Verdon****, Hae Joo Kang**5*, Tatsuro Shimamural*?, Yayoi Nomural*?, Yo Sonoda‘, 
Saba Abdul Hussien’, Aziz Abdul Qureshi’, Mathieu Coincon’, Yumi Sato’?, Hitomi Abe’, Yoshiko Nakada-Nakura’?, 
Tomoya Hino'?, Takatoshi Arakawa!?, Osamu Kusano-Arai®, Hiroko Iwanari®, Takeshi Muratab??>?, Takuya Kobayashi!?, 
Takao Hamakubo®, Michihiro Kasahara’, So Iwatab?)+>%°? & David Drew*” 


The altered activity of the fructose transporter GLUTS, an isoform of the facilitated - diffusion glucose transporter family, 
has been linked to disorders such as type 2 diabetes and obesity. GLUTS is also overexpressed in certain tumour cells, and 
inhibitors are potential drugs for these conditions. Here we describe the crystal structures of GLUTS from Rattus 
norvegicus and Bos taurus in open outward- and open inward-facing conformations, respectively. GLUTS has a 
major facilitator superfamily fold like other homologous monosaccharide transporters. On the basis of a comparison 
of the inward-facing structures of GLUT5S and human GLUTI, a ubiquitous glucose transporter, we show that a single 
point mutation is enough to switch the substrate-binding preference of GLUT5 from fructose to glucose. A comparison of 
the substrate-free structures of GLUT5 with occluded substrate-bound structures of Escherichia coli XylE suggests that, 
in addition to global rocker-switch-like re-orientation of the bundles, local asymmetric rearrangements of 
carboxy-terminal transmembrane bundle helices TM7 and TM10 underlie a ‘gated-pore’ transport mechanism in 


such monosaccharide transporters. 


GLUT transporters belong to the solute carrier 2 family (SLC2) and, 
so far, 14 different isoforms (GLUT1-GLUT14) have been iden- 
tified’*. Except for GLUT13, GLUT transporters are uniporters, 
which facilitate the diffusion of monosaccharides like glucose and 
fructose across the cell membrane in a concentration-dependent 
manner’’. Each GLUT transporter shows a distinct pattern of tissue 
distribution, gene regulation, substrate preference and kinetic prop- 
erties’. For example, GLUT] is distributed in a wide range of tissues, 
including the blood-brain barrier, and is essential for glucose trans- 
port into the brain’, whereas GLUT4 is mostly localized to skeletal 
muscles and adipose tissue, and is the major insulin-stimulated glu- 
cose transporter’*. GLUTS5 is the only member specific to fructose>®, 
and together with GLUT2, which transports fructose in addition to 
glucose, they make up the major fructose transporters in the body’®. 
GLUTS5 is primarily expressed in the small intestine, but lower levels 
are also expressed in brain, adipose tissue, kidney, testes and skeletal 
muscle*”"°. GLUT activity is also associated with various diseases’. 
For instance, increased GLUT5 expression has been linked to several 
metabolic disorders’*’* and several types of cancers such as breast 
cancer", because of the higher energy demand of cancer cells stimu- 
lating sugar uptake, the so-called Warburg effect’*. GLUT transpor- 
ters belong to the larger major facilitator superfamily (MFS), the 
members of which have a fold consisting of two symmetrical six 
transmembrane helix (TM) bundles’®*’’. Within the MFS they belong 
to a subfamily of sugar porter transporters'*'’, whose members are 
found in all domains of life and are important targets for industrial 
and biomedical applications’. Recently, an open inward-facing 


structure of human GLUT was reported with a bound sugar from 
a detergent head-group in the substrate-binding site and compared to 
previous structures of the related Escherichia coli D-xylose:H* sym- 
porter XylE in the outward- and inward-occluded conformations, 
suggesting a ‘rocker-switch’-type transport mechanism”. 
However, as little is known about the molecular basis of substrate 
binding and release in GLUT transporters, their alternating-access 
mechanism is yet to be fully understood. 


Open outward and inward GLUTS structures 


Rat and bovine GLUT5 (rGLUT5 and bGLUT5, respectively) that 
share ~ 81% sequence identity to human GLUTS were selected and 
optimized for structural studies using fluorescence-based screening 
methods (Methods). rGLUT5 was crystallized in complex with an Fv 
antibody fragment (rGLUT5-Fy; see Methods). The rGLUT5-Fv and 
bGLUTS5 structures were solved by molecular replacement (MR) and 
refined against data extending up to 3.3 A and 3.2/4.0 A (anisotropic 
data), respectively (Extended Data Tables 1 and 2, Extended Data 
Fig. 1 and Methods). The GLUT5 structure shows the typical MFS 
fold, plus five additional helices on the intracellular side, one at the C 
terminus (ICH5) and the other four, ICH1-ICH4, located between 
the amino- and C-terminal TM bundles (Fig. 1). bGLUTS5 crystallized 
in an open inward-facing conformation (Fig. 1), and although human 
GLUT1 (hGLUT1) and bGLUTS share only 43% sequence identity, 
their inward-facing structures superimpose well, with a root mean 
squared deviation (r.m.s.d.) of 1.12A for 364 pairs of Cx atoms 
(Methods and Extended Data Fig. 2a). The rGLUT5-Fv structure 
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Figure 1 | Structures of rat GLUTS in the open outward-facing 
conformation and bovine GLUTS in the open inward-facing conformation. 
a, Ribbon representation of open outward-facing rat GLUTS (left) and open 
inward-facing bovine GLUTS (right) structures, viewed in the plane of the 
membrane. TM1 and TM4, and TM2, TM3, TM5 and TM6 in the N-terminal 
TM bundle are coloured in blue and light-blue, respectively. TM7 and TM10, 
and TM8, TM9, TM11 and TM12 in the C-terminal TM bundle are coloured in 
red and yellow-brown, respectively. The intracellular domain helices ICH1 to 
ICHS are shown in grey. b, Slab through the surface electrostatic potential 

of the open outward-facing (left) and open inward-facing (right) GLUT5 
structures, as viewed within the plane of membrane, which highlight the 
accessibility of the sugar to the central cavity (shown as a dotted ellipse). 

c, Ribbon diagrams of GLUTS viewed from the cytoplasm in the open 
outward-facing (left) and inward-facing (right) conformations. 


shows an open outward-facing conformation, which is a state that has 
not been observed previously in any of the related sugar porter struc- 
tures’? > (Fig. 1). The open outward-facing conformation is possibly 
stabilized by the Fv fragment, which binds to the ICHs (Extended 
Data Fig. 3). 


Central fructose-binding site of GLUT5 

The GLUTS5 substrate-binding site is closely related to those of 
hGLUT1 and E. coli XylE (refs 21, 22) (Fig. 2a and Extended Data 
Fig. 2b). Many of the residues lining the central cavity are conserved 
between GLUTS5 and hGLUT1, and include Ile169, Ile173, Gln166, 
Gln287, Gln288, Asn324 and Trp419 (Fig. 2a and Extended Data 
Fig. 4). In GLUT5, Trp419 is the only tryptophan positioned in the 
substrate-binding site (Fig. 2a and Extended Data Fig. 5a), and it is 
essential for transport*®. Consistent with rGLUT5 transport activity 
(Extended Data Fig. 6a), strong quenching of tryptophan fluorescence 
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Figure 2 | The fructose-binding site of GLUTS. a, The substrate-binding site 
in the inward-facing bGLUT5 structure (left panel; coloured as in Fig. 1) is very 


similar to the inward-facing hGLUT1 structure (right panel; light grey). To 
facilitate comparison to rGLUT5, bGLUTS5 residues are labelled with rGLUT5 
numbering. For hGLUT1, only Q161 and all other residues that are different in 
bGLUTS are labelled. The D-glucopyranoside moiety of bound n-nonyl-f-p- 
glucopyranoside in hGLUT1 is shown in stick representation. b, D-fructose 
binding to GLUTS as measured by tryptophan (Trp) fluorescence quenching 
(excitation 295 nm; emission 338 nm) after addition of increasing 
concentrations of D-fructose to wild-type (WT; black squares) and to wild-type 
protein that had been previously incubated with the GLUT inhibitor HgCl, 
(open circles). c, Trp fluorescence quenching for purified substrate-binding site 
mutants after addition of 40 mM p-fructose (open bars) relative to wild type 
(filled bar). d, Trp fluorescence quenching after addition of either 40 mM 
D-fructose or D-glucose to purified wild type (filled bars) or Q166E (open bars); 
pre-incubation with the inhibitor HgCl, is indicated. e, Trp fluorescence 
quenching after addition of increasing concentrations of D-glucose to either 
purified Q166E (filled squares), Q166E previously incubated with HgCl, 
(open circles) or wild type (open triangles). In all experiments errors bars 
indicate s.e.m.; n = 3. 


could be observed with the addition of D-fructose, but not with the 
addition of L-fructose or known GLUT! substrates like p-glucose, 
D-galactose or D-mannose (Extended Data Fig. 5). Using this assay, 
the affinity of rGLUT5 for D-fructose was measured to have a dissoci- 
ation constant (Kg) around 6-9mM (Fig. 2b and Extended Data 
Fig. 6b), which is similar to that reported for human GLUTS (refs 5, 8). 
Throughout this study tryptophan fluorescence quenching of 
rGLUT5 was henceforth used to assess substrate binding. 

Gln166 is the only conserved residue in the substrate-binding site 
that is positioned differently between the hGLUT1 and GLUTS struc- 
tures (Fig. 2a). In hGLUT1 the equivalent glutamine (Gln161) points 
away, rather than towards, the central cavity as it has probably 
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Figure 3 | Inter-TM salt bridges form between bundle cytoplasmic ends in 
the outward-facing conformation. a, Cartoon representation of GLUT5 as 
viewed from the cytoplasm in the outward-facing (left) and inward-facing 
(right) conformations. ICHs are not shown for clarity. TMs are coloured as in 
Fig. la, and residues forming salt bridges are shown as sticks. To facilitate 
comparison to rGLUT5, bGLUTS5 residues are labelled with rGLUT5 
numbering. Breakage of conserved salt bridges as seen here for GLUT5 has also 
been predicted for XylE”. b, Superimposition of the N- and C-terminal 6-TM 
bundles. Strictly conserved and pseudo-symmetry-related charged residues 
forming the salt bridges are labelled and are shown as sticks. c, D-fructose 
binding as measured by Trp fluorescence quenching after incubation with 

40 mM p-fructose for purified wild-type GLUTS (filled bar) and single alanine 
mutations of key acidic residues E400 and E151 that form inter-bundle salt 
bridges, and G389, which is located in the hinge point of TM10 critical for 
TM10 conformational change (open bars). Trp fluorescence quenching by 
D-fructose for these mutants is displayed as a percentage of wild-type binding. 
In all experiments errors bars indicate s.e.m.; 1 = 3. 


re-orientated to accommodate the acyl chain of the bound detergent 
in the sugar-binding site (Fig. 2a). In XylE the equivalent glutamine 
(Gln168) is orientated similarly as in GLUT5 (Extended Data Fig. 2b). 
Other residues that line the binding site, but are not conserved 
between GLUT5 and hGLUT1, are Tyr31 (Phe26 in hGLUT1), 
His386 (Phe379), Ala395 (Trp388), His418 (Asn411) and Ser391 
(Gly384) (Fig. 2a and Extended Data Fig. 4). Single alanine mutants 
of each of these residues show strongly reduced D-fructose binding 
(Fig. 2c). The alanine mutants that show the weakest D-fructose bind- 
ing are Tyr31 (TM1), Gln287 (TM7), His386 (TM10) and His418 
(TM11). Except for Tyr31, these residues belong to the C-terminal 
TM bundle, indicating an asymmetrical binding mode of the sugar 
between the N- and C-terminal TM bundles in GLUT5, as seen in the 
sugar-bound hGLUT1 and XylE structures”’”*. The substrate-binding 
cavity is deeper in GLUTS5, because the tryptophan residue found at 
the bottom of the cavity in hGLUT1 (Trp388) and XylE (Trp392) is 
replaced with alanine (Ala395) in GLUTS5 (Fig. 2a and Extended Data 
Fig. 6c). Trp388 in hGLUT1 is critical for inhibition by cytochalasin 
B*°, whereas GLUTS is insensitive to this inhibitor’®. 

GLUT7 is the closest isoform to GLUTS5 and transports both 
p-fructose and D-glucose’. Among the substrate-binding-site resi- 
dues of GLUTS almost all are identical to those in GLUT7 
(Extended Data Fig. 4). The most notable difference is that Gln166 
in GLUTS5 is replaced in GLUT7 with glutamate (Extended Data 
Fig. 4). In rGLUTS, the substitution of Gln166 (TM5) with glutamate 
clearly weakens D-fructose binding (Fig. 2c, d). Furthermore, the 
Q166E mutant now shows robust binding to D-glucose with an appar- 
ent affinity (Ky) of ~ 4mM (Fig. 2d, e). The introduced carboxylate is 
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Figure 4 | Substrate-induced gates are predominantly formed by TM7 and 
TM10 in the C-terminal bundle. a, Superposition of GLUT5 open outward- 
and inward-facing (*) structures, as viewed from the extracellular (left) and 
intracellular (right) side of the membrane. TMs are coloured as in Fig. 1a, 
except inward-facing helices TM1*, TM4* shown in orange and TM7*, TM10* 
shown in cyan. ICHs have been removed for clarity. b, Superimposition of the 
GLUT5 open outward- and inward-facing structures as viewed in the plane of 
the membrane. For clarity, TM5, TM5*, TM8 and TM8* are not shown. 
Cavity-closing contacts are mostly formed by TM1* and TM7* on the 
extracellular side in the inward-facing conformation and by TM4 and TM10 on 
the intracellular side in the outward-facing conformation. These TMs are the 
first TMs in each of the four 3-TM repeats of the MFS fold’**”. p-xylose, as it is 
in the occluded-outward-facing XylE structure (PDB 4GBY), is shown in stick 
representation. With the inward movement of TM7, conserved tyrosine 
residues are likely to occlude the substrate from exiting, as seen for the 
equivalently located tyrosine residues in the substrate-occluded XylE 
structure” and as supported by D-fructose binding data (Extended Data Fig. 
6d). The opening movement of TM10 to enable cytosolic substrate release has 
been described previously for XylE**** and other unrelated MFS 
transporters**”’. c, Interactions between hydrophobic residues between TM7 
and TM10 in the outward-facing conformation (left) are lost in the inward- 
facing conformation (right). To facilitate comparison to rGLUT5, bGLUTS5 
residues are labelled with rGLUT5 numbering. 


likely to have a similar role to Glu380 (TM10) in hGLUT1 (Fig. 2a), 
which is essential for D-glucose transport”. 


Salt bridges stabilize the outward conformation 


A common feature observed in recent structures of MFS transporters 
are inter-TM bundle salt bridges that break and form near the central 
cavity during transport'®?*°. In H*-coupled transporters, it is 
thought that the protonation state of these salt bridges is coupled to 
substrate binding and further to structural transitions’®””°. In 
GLUTS, no salt bridges are observed near the central cavity in either 
conformation. Instead, inter-TM bundle salt bridges are observed 
only in the outward-facing conformation and far from the central 
cavity, linking the cytoplasmic ends of TM3, TM4 and TMS in the 
N-terminal TM bundle to those of TM9, TM10 and TM11 in the 
C-terminal TM bundle (Fig. 3a). Specifically, Glul151 (TM4) forms 
a salt bridge to both Arg97 (TM3) and Arg407 (TM11), and similarly 
Glu400 (TM10) forms a salt bridge to both Arg158 (TM5) and Arg340 
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Figure 5 | Alternating-access transport mechanism in GLUT5. Schematic 
representation of the rocker-switch-type movement of the N- and C-terminal 
TM bundles and of the local, gating conformational changes of TM7 and 
TM10 supporting a gated-pore-type transport mechanism in GLUT5. 


(TM9). Owing to their strict conservation these residues make up the 
well-described sugar porter signatures'*!” (Extended Data Fig. 4), and 
are related by a pseudo-two-fold symmetry axis running through the 
centre of the transporter and perpendicular to the membrane plane 
(Figs la and 3b). In the inward-facing conformation the inter-TM 
bundle salt-bridge pairs Glu400-Arg158 and Glul51-Arg407 are 
separated by some ~7A and 13A, respectively (Fig. 3a). In the 
inward-facing conformation, no inter-TM bundle salt bridges are 
formed on the extracellular side, indicating that in the absence of 
substrate GLUT proteins may favour the outward-facing conforma- 
tion. Consistently, the substitution of Glu400, Arg407 and Glu336 
equivalent residues in GLUT4 to neutral amino acids arrests the 
transporter in an inward-facing conformation*!. Moreover, in the 
outward-facing conformation Glu336 (TM8) in the C-terminal TM 
bundle forms a salt bridge to Arg340, which is connected to the inter- 
TM bundle Glu400-Arg158 salt bridge (Fig. 3a). The equivalent glu- 
tamate to Glu336 in hGLUT1 (Glu329) was substituted to glutamine 
to stabilize the inward-facing hGLUT1 structure’’. The salt bridges 
formed between inter-TM bundles seem to be coupled also to sub- 
strate binding in GLUTS, because substituting Glul51 and Glu400 
with alanine shows significantly reduced D-fructose binding (Fig. 3c). 

The ICH domain is below this cytoplasmic salt-bridge network, 
and positioned similarly, with respect to the N- and C-terminal TM 
bundles, in both outward-open GLUTS5 and outward-occluded XylE 
structures” (Extended Data Fig. 7a). ICH1, ICH2 and ICH3 are linked 
together by salt bridges, as previously shown for XylE and GLUT1 
(refs 21, 22). In GLUT5, ICH5 lacks charged residues that could 
interact with ICH1-ICH3 (Extended Data Fig. 7b), as it does in the 
outward-occluded XylE structure”. Rather, the N-terminal helices 
ICH1-ICH3 interact with the C-terminal bundle via a salt bridge 
formed between Glu252 in ICH3 and Arg407 in TM11; thus linking 
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the ICH domain to a TM involved in the inter-bundle salt-bridge 
network. In inward-facing GLUT5, these interactions are broken 
(Extended Data Fig. 7c) and, as observed in the inward-facing 
hGLUT1 and XylE structures*’****, ICH5 could not be built 
(Extended Data Fig. 2a). Therefore, the role of the ICH domain might 
be to provide additional stabilization of the outward-facing con- 
formation, as suggested previously”’. 


TM7 and TM10 form substrate-induced gates 


In GLUT5, the N- and C-terminal TM bundles undergo a small 
rotation of ~ 15° between the open outward- and inward-facing con- 
formations (Fig. 1). As observed in other MFS transporter struc- 
tures’®, cavity-closing contacts in GLUT5 form mostly between 
TMI and TM7 on the outside, and between TM4 and TM10 on the 
inside (Figs 1c and 4a). Among these helices, however, TM7 and 
TM10 in the C-terminal TM bundle seem to have the most prominent 
roles in occluding the substrate-binding site from the outside and 
inside, respectively (Fig. 4b). Comparisons of the substrate-free open 
GLUTS5 structures with the substrate-occluded XylE structures?” 
highlight the central role of TM7 and TM10 in gating (Extended 
Data Fig. 8). Inverted-symmetry-related TM7 and TM10 make up 
highly conserved sugar transporter signatures”, and they also form a 
large fraction of the substrate-binding site in hGLUT1 (ref. 21) and 
XylE (refs 21, 22), in agreement with previous functional data***?*’, 
Between the outward-open GLUT5 conformation and outward- 
occluded XylE conformation”, the extracellular half of TM7 shows 
the largest shift towards the substrate-binding site (Extended Data 
Fig. 8a). Similarly, from the inward-occluded XylE conformation” 
to the inward-open GLUT5 conformation, the intracellular half of 
TM10 shows the largest shift away from the substrate-binding site 
(Extended Data Fig. 8b, c); this TM10 movement has also been 
described previously for XylE”**. In GLUTS the observed conforma- 
tional changes in TM7 and TM10 occur at hinge points that contain 
glycine residues (Fig. 4b, c and Extended Data Fig. 4). Tyr382 in TM10 
interacts tightly with TM7 residues Ile295 and Val292 in the outward- 
facing state and these interactions do not take place in the inward- 
facing conformation (Fig. 4c). The interactions between TM7 and 
TM10 seem important to coordinate their conformational changes 
during transport, because mutation of the Ile295 equivalent residue in 
hGLUT5 to valine or alanine abolishes D-fructose transport™. 
Consistently, similar interface-perturbing mutations of Ile295 signifi- 
cantly reduce D-fructose binding in rGLUT5 (Extended Data Fig. 6d). 
Furthermore, in hGLUT1 the equivalent residue to Val292 (Ile287) 
was substituted to every other amino acid, and each mutant shows 
markedly different D-glucose affinities and transport kinetics”. 
Therefore, it suggests that the interactions between TM7 and TM10 
are important and tuned with respect to substrate affinity and trans- 
port kinetics. 


Conclusions 


Symmetrical substrate binding and rigid-body movements of the 
N- and C-terminal TM bundles around a centrally located sub- 
strate-binding site form the structural basis of the ‘rocker-switch’ 
mechanism in MEFS transporters’®**. However, here we have 
described asymmetrical rearrangements in the MFS subfamily of 
sugar porters, consistent with the asymmetrical binding mode of 
the sugar reported previously in XylE and hGLUT1 structures”. 
Therefore, we conclude that transport in GLUTS is not only con- 
trolled by the rocker-switch-type movement” of the N- and 
C-terminal TM bundles, but also by a gated-pore mechanism, in 
the form of local, gating movements by TM7 and TM10 in the 
C-terminal TM bundle that are coupled to substrate binding and 
release (Fig. 5 and Supplementary Videos 1, 2 and 3). TM10 is also 
part of the inter-TM bundle salt-bridge network, indicating how local 
gating and global rocker-switch-type movements are coupled. 
Importantly, a deeper understanding of GLUTS5 structure and 


©2015 Macmillan Publishers Limited. All rights reserved 


transport mechanism, as described here, should facilitate the struc- 
ture-based design of novel inhibitors with therapeutic potential. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 


Received 19 February; accepted 14 July 2015. 
Published online 30 September 2015. 


1. Mueckler, M. & Thorens, B. The SLC2 (GLUT) family of membrane transporters. 
Mol. Aspects Med. 34, 121-138 (2013). 

2. Zhao, F. Q. & Keating, A. F. Functional properties and genomics of glucose 
transporters. Curr. Genomics 8, 113-128 (2007). 

3. Simpson, |. A., Vannucci, S. J. & Maher, F. Glucose transporters in mammalian 
brain. Biochem. Soc. Trans. 22, 671-675 (1994). 

4. James, D. E., Strube, M. & Mueckler, M. Molecular cloning and characterization of 
an insulin-regulatable glucose transporter. Nature 338, 83-87 (1989). 

5. Burant, C. F., Takeda, J., Brot-Laroche, E., Bell, G. |. & Davidson, N. O. Fructose 
transporter in human spermatozoa and small intestine is GLUT5. J. Biol. Chem. 
267, 14523-14526 (1992). 

6. Kayano, T. et al. Human facilitative glucose transporters. Isolation, functional 
characterization, and gene localization of cDNAs encoding an isoform (GLUT5) 
expressed in small intestine, kidney, muscle, and adipose tissue and an unusual 
glucose transporter pseudogene-like sequence (GLUT6). J. Biol. Chem. 265, 
13276-13282 (1990). 

7. Blakemore, S. J. et al. The GLUT5 hexose transporter is also localized to the 
basolateral membrane of the human jejunum. Biochem. J. 309, 7-12 (1995). 

8. Douard, V. & Ferraris, R. P. Regulation of the fructose transporter GLUT5 in health 
and disease. Am. J. Physiol. Endocrinol. Metab. 295, E227-E237 (2008). 

9. Rand, E.B., Depaoli, A. M., Davidson, N. O., Bell, G.1.& Burant, C. F. Sequence, tissue 
distribution, and functional characterization of the rat fructose transporter GLUT5. 
Am. J. Physiol. 264, G1169-G1176 (1993). 

0. Shepherd, P. R., Gibbs, E. M., Wesslau, C., Gould, G. W. & Kahn, B. B. Human small 
intestine facilitative fructose/glucose transporter (GLUT5) is also present in 
insulin-responsive tissues and brain. Investigation of biochemical characteristics 
and translocation. Diabetes 41, 1360-1365 (1992). 

1. Zisman, A. et al. Targeted disruption of the glucose transporter 4 selectively in 
muscle causes insulin resistance and glucose intolerance. Nature Med. 6, 
924-928 (2000). 

2. Barone, S. et al. Slc2a5 (Glut5) is essential for the absorption of fructose in the 
intestine and generation of fructose-induced hypertension. J. Biol. Chem. 284, 
5056-5066 (2009). 

3. Douard, V. & Ferraris, R. P. The role of fructose transporters in diseases linked to 
excessive fructose intake. J. Physiol. 591, 401-414 (2013). 

4. Zamora-Leon, S. P. et al. Expression of the fructose transporter GLUT5 in human 
breast cancer. Proc. Nat! Acad. Sci. USA 93, 1847-1852 (1996). 

5. Warburg, O. On respiratory impairment in cancer cells. Science 124, 269-270 
(1956). 

6. Yan, N. Structural advances for the major facilitator superfamily (MFS) 
transporters. Trends Biochem. Sci. 38, 151-159 (2013). 

7. Madej, M. G., Sun, L, Yan, N. & Kaback, H. R. Functional architecture of MFS 

D-glucose transporters. Proc. Nat! Acad. Sci. USA 111, E719-E727 (2014). 

8. Maiden, M. C., Davis, E. O., Baldwin, S. A., Moore, D. C. & Henderson, P. J. 

ammalian and bacterial sugar transport proteins are homologous. Nature 325, 

641-643 (1987). 

9. Pao,S.S., Paulsen, |. T. & Saier, M. H. Jr. Major facilitator superfamily. Microbiol. Mol. 

Biol. Rev. 62, 1-34 (1998). 

20. Farwick, A., Bruder, S., Schadeweg, V., Oreb, M. & Boles, E. Engineering of yeast 

hexose transporters to transport D-xylose without inhibition by D-glucose. Proc. 

Natl Acad. Sci. USA 111, 5159-5164 (2014). 

21. Deng, D. et al. Crystal structure of the human glucose transporter GLUT1. Nature 
510, 121-125 (2014). 

22. Sun, L. et al. Crystal structure of a bacterial homologue of glucose transporters 
GLUT1-4. Nature 490, 361-366 (2012). 

23. Quistgaard, E. M., Low, C., Moberg, P., Tresaugues, L. & Nordlund, P. Structural 
basis for substrate transport in the GLUT-homology family of monosaccharide 
transporters. Nature Struct. Mol. Biol. 20, 766-768 (2013). 

24. Wisedchaisri, G., Park, M. S., ladanza, M. G., Zheng, H. & Gonen, T. Proton-coupled 
sugar transport in the prototypical major facilitator superfamily protein XylE. Nat. 
Commun. 5, 4521 (2014). 

25. lancu, C. V.,Zamoon, J., Woo, S. B., Aleshin, A. & Choe, J. Y. Crystal structure of a 
glucose/H* symporter and its mechanism of action. Proc. Natl Acad. Sci. USA 110, 
17862-17867 (2013). 

26. Garcia, J. C., Strube, M., Leingang, K., Keller, K. & Mueckler, M. M. Amino acid 
substitutions at tryptophan 388 and tryptophan 412 of the HepG2 (Glut1) glucose 
transporter inhibit transport activity and targeting to the plasma membrane in 
Xenopus oocytes. J. Biol. Chem. 267, 7770-7776 (1992). 


ARTICLE 


27. Li, Q. et al. Cloning and functional characterization of the human GLUT7 isoform 
SLC2A7 from the small intestine. Am. J. Physiol. Gastrointest. Liver Physiol. 287, 
G236-G242 (2004). 

28. Mueckler, M. & Makepeace, C. Analysis of transmembrane segment 10 of the Glut1 
glucose transporter by cysteine-scanning mutagenesis and substituted cysteine 
accessibility. J. Biol, Chem. 277, 3498-3503 (2002). 

29. Andersson, M. et al. Proton-coupled dynamics in lactose permease. Structure 20, 
1893-1904 (2012). 

30. Law, C. J. et al. Salt-bridge dynamics control substrate-induced conformational 
change in the membrane transporter GIpT. J. Mol. Biol. 378, 8328-839 (2008). 

31. Schiirmann, A. et al. Role of conserved arginine and glutamate residues on the 
cytosolic surface of glucose transporters for transporter function. Biochemistry 36, 
12897-12902 (1997). 

32. Seatter, M. J., De la Rue, S. A., Porter, L. M. & Gould, G. W. QLS motif in 
transmembrane helix VII of the glucose transporter family interacts with the C-1 
position of D-glucose and is involved in substrate selection at the exofacial binding 
site. Biochemistry 37, 1322-1326 (1998). 

33. Hruz, P.W. & Mueckler, M. M. Cysteine-scanning mutagenesis of transmembrane 
segment 7 of the GLUT1 glucose transporter. J. Biol. Chem. 274, 36176-36180 
(1999). 

34. Manolescu, A., Salas-Burgos, A. M., Fischbarg, J.& Cheeseman, C. |. Identification of 
a hydrophobic residue as a key determinant of fructose transport by the facilitative 
hexose transporter SLC2A7 (GLUT7). J. Biol. Chem. 280, 42978-42983 (2005). 

35. Kasahara, T., Maeda, M., Boles, E. & Kasahara, M. Identification of a key residue 
determining substrate affinity in the human glucose transporter GLUT1. Biochim. 
Biophys. Acta 1788, 1051-1055 (2009). 

36. Karpowich, N. K. & Wang, D. N. Structural biology. Symmetric transporters for 
asymmetric transport. Science 321, 781-782 (2008). 

37. Radestock, S. & Forrest, L. R. The alternating-access mechanism of MFS 
transporters arises from inverted-topology repeats. J. Mol. Biol. 407, 698-715 
(2011). 

38. Solcan, N. et al. Alternating access mechanism in the POT family of oligopeptide 
transporters. EMBO J. 31, 3411-3421 (2012). 

39. Fukuda, M. etal. Structural basis for dynamic mechanism of nitrate/nitrite antiport 
by Nark. Nature Commun. 6, 7097 (2015). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We are grateful to D. Slotboom, A. Cameron and S. Newstead for 
discussions and comments, and J. Mansfield for assistance with large-scale yeast 
fermentations and H. Unno with rGLUT5 crystallization. Data were collected at the 
European Synchrotron Radiation Facility, Diamond Light Source, and SPring-8 
(proposal numbers 2011A1393, 2011B1229, 2012A1184, 2012B1253, 
2013A1241, 2013B1237, 2014A1348 and 2014B1407), with assistance from 
beamline scientists. This work was funded by the Knut and Alice Wallenberg 
Foundation (D.D), The Royal Society through the University Research Fellow scheme 
(D.D), the BBSRC (BB/G02325/1 to S.I.), the ERATO Human Receptor Crystallography 
Project of the Japan Science and Technology Agency (JST) (S.I.), by the Research 
Acceleration Program of the JST (S.I.), by the Targeted Proteins Research Program of 
the Ministry of Education, Culture, Sports, Science and Technology (MEXT) of Japan 
(S..), and by Grants-in-Aids for Scientific Research from the MEXT (No. 22570114 to 
N.N.), and by the Platform for Drug Discovery, Informatics, and Structural Life Science 
from the MEXT (T.K.). The authors are grateful for the use of the Membrane Protein 
Laboratory funded by the Wellcome Trust (grant 062164/Z/00/Z) at the Diamond 
Light Source Limited, and The Centre for Biomembrane Research (CBR), supported by 
the Swedish Foundation for Strategic Research. H.J.K. was a recipient of a Human 
Frontiers Postdoctoral fellowship and D.D. acknowledges support from EMBO through 
the Young Investigator Program (YIP). 


Author Contributions N.N., S.l. and D.D. designed the project. Cloning, expression 
screening and initial crystallization of rat and bovine GLUT5 was carried out by HJ.K., 
Y.So. and D.D. Crystal optimization of bovine GLUT5 was carried out by HJ.K. and G.V. 
Data collection, structure determination and refinement of bovine GLUT5 was carried 
out by G.V. Generation of rat GLUT5 scFv fragment was carried out by N.N., Y.N., T.M., 
Y.N.-N., O.K.-A., H.I., T.A., T.K. and T.Ha. Expression and purification of the Fv fragment 
was carried out by N.N., Y.N., Y.Sa., H.A. and T.Hi. Co-crystallization of rat GLUT5-Fv 
complex and data collection was performed by N.N. and Y.N. with assistance from T.Hi. 
and S.I. Structure determination and refinement of rat GLUT5-Fv was carried out by 
T.S. Experiments for functional analysis were designed by M.K. and D.D. and carried out 
by M.K., D.D., S.A.H. and A.A.Q. Modelling of GLUT5 was carried out by M.C. The 
manuscript was prepared by N.N., H.J.K., G.V., S.I. and D.D. All authors discussed the 
results and commented on the manuscript. 


Author Information The coordinates and the structure factors for bovine and rat 
GLUT5 have been deposited in the Protein Data Bank under accessions 4YB9 and 
AYBQ, respectively. Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. 
Correspondence and requests for materials should be addressed to N.N. 
(nnomura@mfour.med.kyoto-u.ac.jp), S.l. (s.iwata@mfour.med.kyoto-u.ac.jp) or D.D. 
(ddrew@dbb.su.se). 


15 OCTOBER 2015 | VOL 526 | NATURE | 401 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


METHODS 


No statistical methods were used to predetermine sample size. 

Rattus norvegicus GLUTS5 cloned full-length sequence (UniProt accession 
number P43427); N50Y deglycosylation mutation is underlined and additional 
C-terminal residues retained after TEV cleavage are shown in italic (see next 
section for cloning details): MEKEDQEKTGKLTLVLALATFLAAFGSSFQ 
YGYNVAAVNSPSEFMQQFYYDTYYDRNKENIESFTLTLLWSLTVSMFPFG 
GFIGSLMVGFLVNNLGRKGALLFNNIFSILPAILMGCSKIAKSFEINASRLLV 
GICAGISSNVVPMYLGELAPKNLRGALGVVPQLFITVGILVAQLFGLRSVL 
ASEEGWPILLGLTGVPAGLQLLLLPFFPESPRYLLIQKKNESAAEKALQTLR 
GWKDVDMEMEEIRKEDEAEKAAGFISVWKLFRMQSLRWQLISTIVLMT 
GQQLSGVNATYYYADQITYLSAGVKSNDVQYVTAGTGAVNVFMTMVTV 
FVVELWGRRNLLLIGFSTCLTACIVLTVALALQNTISWMPYVSIVCVIVY 
VIGHAVGPSPIPALFITEIFLQSSRPSAYMIGGSVHWLSNFIVGLIFPFIQVG 
LGPYSFIIFAIICLLTSIYIFMV VPETKGRTFVEINQIFAKKNKVSDVYPEKE 
EKELNDLPPATREQENLYFQ. 

Bos taurus GLUTS cloned sequence containing 1-473 out of 501 residues 
(UniProt accession number P58353); N51A deglycosylation mutation is under- 
lined and additional C-terminal residues retained after TEV cleavage are shown 
in italic (see next section for cloning details): MEPQDPVKREGRLTPVIVLAT 
LIAAFGSSFQYGYNVAAINSPSEFMKDFYAYTYYDRVGEYMNEFYLTLLW 
SVTVSMFPFGGFLGSLMVGPLVNNLGRKGTLLFENNIFSIVPALLMGFSELA 
KSFEMITVARVLVGICAGLSSNV VPMYLGELAPKNWRGALGVVPQLFITI 
GILVAQIFGLRSLLANEEGWPILLGLTGIPAVLQLLFLPFFPESPRYLLIQKK 
DEAAAKSALRRLRGWHDVDAEIEEILEEDRAEKAVGFISVLKLFKMRSLR 
WOQVISITVLMAGQQLSGVNAITYYYADQIYLSAGVNEDDVQYVTAGTGAV 
NVLITVCAIFV VELMGRRFLLLLGFSVCFTACCVLTGALALQDVISWMPY 
VSIACVISY VIGHALGPSPIPALLVTEIFLQSSRPAA YMVAGTVHWLSNFTV 
GLVFPFIQVGLGAYSFVIFAVICLLTTVYIFLUPETKSKTFIEINRENLYFQ. 

Fv light chain variable region (V,); additional C-terminal residues, retained 
after TEV cleavage, are shown in italics: ELDIVLTQSPLSLPVSLGDQASISC 
RSSQSIVHSNGNTYLEWYLQKPGQSPKLLIYKVSNRFSGVPDRFSGSGSGTD 
FTLKISRVEAEDLGVY YCFQGSHVPYTFGGGTKLEIKTSENLYFQ. 

Fv heavy chain variable region (Vj); additional C-terminal residues retained 
after TEV cleavage are shown in italics: LEVNLVESGGGLVQPGGSRKLS 
CAASGFTFSSFGMHW VRQAPEKGLEWVAHISSGSRTIDYADTVKGRFTIS 
RDNPKNTLFLQMTSLRSEDTAIYYCARGNGYYDALDYWGQGTSVTVSSA 
KTTPPSVTSENLYFQ. 

Target identification using fluorescence-based screening methods. GLUT5S 
homologues were cloned into the GALI inducible TEV cleavable GFP-Hisg 
2u vector pDDGFP2, transformed into the Saccharomyces cerevisiae strain 
FGY217 (MATa, ura3-52, lys2A201, and pep4A)*° and overexpressed as 
described previously*'’. In brief, 10 ml S. cerevisiae FGY217 cell cultures in 
—URA media and 0.1% glucose were grown at 30 °C and expression was induced 
by the addition of final 2% (v/v) D-galactose at an ODgoo of 0.6. After ~ 22 h, cells 
were harvested, resuspended in 1 X PBS buffer and overexpression levels assessed 
by whole-cell GFP fluorescence*'*. Fusions with detectable expression levels 
were re-grown in larger 2-1 culture volumes and membranes subsequently iso- 
lated. The monodispersity of expressed fusions was screened in crude dodecyl-p- 
p-maltopyranoside (DDM), decyl-f-b-maltopyranoside (DM), nonyl-B-p-mal- 
topyranoside (NM) and n-dodecyl-N,N-dimthylamine-N-oxide (LDAO) solubi- 
lized membranes by fluorescence-detection size exclusion chromatography 
(FSEC)* as described previously*’. Out of the GLUTS homologues screened, 
rat and bovine GLUT5 showed the sharpest monodispersity profiles and was 
the most stable after purification in detergent™. 

Large-scale production and purification of rat and bovine GLUT5. For rat 
GLUT5, cells were harvested from 101 S. cerevisiae cultures, resuspended in buffer 
containing 50 mM Tris-HCl pH 7.6, 1mM EDTA, 0.6 M sorbitol, and lysed by 
mechanical disruption as described previously’. Membranes were isolated by 
ultracentrifugation at 195,000g for 3h, homogenized in 20mM Tris-HCl 
pH 7.5, 0.3 M sucrose, 0.1 mM CaCl, frozen in liquid nitrogen and stored at 
-80°C. Rat GLUT5 membranes were solubilized with 1% DDM in equilibration 
buffer (EB) consisting of 1 X PBS, 10 mM imidazole, 150 mM NaCl, 10% glycerol. 
After ultracentrifugation at 195,000 gfor 45 min, the supernatant was incubated with 
10 ml of Ni’ * -nitrilotriacetate affinity resin (Ni-NTA; Qiagen) for 2h at 4°C. The 
resin was washed with 100 ml of EB containing 0.05% DDM and 30 mM imidazole, 
and the protein eluted in 25 ml of EB containing 0.05% DDM and 250 mM imida- 
zole. The eluted protein was incubated with TEV-His, protease to remove the GFP- 
His, tag and dialysed against 20 mM Tris-HCl pH 7.5, 150 mM NaCl, 10% glycerol, 
0.03% DDM. Dialysed sample was then loaded onto a 5 ml HisTrap column (GE 
Healthcare) equilibrated in dialysis buffer, and the flow-through was collected and 
concentrated. The protein was further purified by size-exclusion chromatography 


(SEC) using a Superdex 200 column (GE Healthcare) in buffer consisting of 20 mM 
Tris-HCl pH 7.5, 150 mM NaCl, 0.02% DDM. All rat GLUT5 mutants were gener- 
ated with a standard PCR-based strategy and were sub-cloned, overexpressed and 
purified as described for wild type. 

Protein complexes were prepared by incubation of the HisTrap-purified rat 
GLUT5 with the SEC-purified Fv fragment (supplemented with 0.02% (w/v) 
DDM) at a molar ratio of 1:1.5 for 1h on ice. The complex was subjected to 
SEC (Superdex 200, GE Healthcare). The SEC step was repeated twice to ensure 
reproducibility of crystals. Peak fractions containing rat GLUT5-Fv complex 
were concentrated to ~10mgml™! by ultrafiltration (Millipore, MWCO 
50 kDa), and immediately used for crystallization experiments. 

For bovine GLUT5, membranes were isolated from 40 | S. cerevisiae cultures. 
After membrane solubilization and ultracentrifugation (as described for rat 
GLUT5), the supernatant was supplemented with 55 mM imidazole, and incubated 
with 30 ml of Ni-NTA resin for 2 h at 4 °C. The resin was washed with 600 ml of EB 
supplemented with 55 mM imidazole and 0.03% DDM, and the protein was eluted 
using 125 ml of EB containing 250 mM imidazole and 0.03% DDM. After cleavage of 
GFP-Hisg cleavage by TEV-His, protease the material was re-loaded onto the same 
30 ml Ni-NTA resin column (no imidazole), the flow-through was concentrated and 
passed through a 1 ml Ni-NTA column for further clean up. The untagged protein 
was further purified by SEC in buffer containing 10 mM Tris-HCl, pH 7.5, 150 mM 
NaCl, and 0.09% undecyl-B-maltopyranoside (UDMB), and then concentrated to 
~4mgml " for crystallization experiments. 

Generation of single-chain Fv (scFv) fragments. Animal experiments con- 
formed to the guidelines outlined in the Guide for the Care and Use of 
Laboratory Animals of Japan and were approved by the University of Tokyo 
Animal Care Committee (approval no. RACO7101). Rat GLUT5-specific scFv 
fragment were generated essentially as previously described*. In brief, purified 
GLUT5 was reconstituted into liposomes containing chicken egg yolk phospha- 
tidylcholine (egg PC; Avanti) and adjuvant lipid A (Sigma-Aldrich) by detergent 
removal method. Small unilamellar proteoliposomes were prepared by sonica- 
tion. MRL/Ipr mice were immunized with 0.1 mg of the proteoliposome antigen 
three times at two-week intervals. Immunized mice were killed, and RNA in their 
splenocytes was isolated and converted into cDNA via reverse-transcription PCR. 
The V, and Vy repertoire was assembled via an 18-amino-acid flexible linker and 
cloned into the phage-display vector pComb3XSS. Biotinylated proteoliposomes 
were prepared by reconstituting GLUT5 with a mixture of egg PC and 1,2-dipal- 
mitoyl-sn-glycero-3-phosphoethanolamine-N-(cap biotinyl) (16:0 biotinyl Cap- 
PE; Avanti), and used as binding targets for scFv-phage selection. The targets 
were immobilized onto streptavidin-coated paramagnetic beads (Dynabeads) or 
streptavidin-coated microplates (Nunc). After four rounds of biopanning, pro- 
teoliposome-targeted enzyme-linked immunosorbent assays (‘liposome ELISAs’) 
were performed on periplasmic extracts of individual colonies. Positive clones 
were collected and evaluated using a Biacore T100 (GE Healthcare). 
Expression and purification of Fv fragment. Antibody fragments in the scFv 
format are undesirable for use as crystallization chaperones because they are able to 
intermolecularly form domain-swapped dimers, and dimer-monomer equilibrium 
may increase structural heterogeneity. Therefore, we used Fv fragments for crystal- 
lization trials. The Fv fragments were expressed in Brevibacillus choshinensis as a 
secreted His,-tagged protein and purified from culture medium. The cells were 
grown at 30°C with 200rpm in 2SY medium (soypton 40g1~', yeast extract 
5g1 ', glucose 20g] *, CaCl, 0.15 gl *) supplemented with 50 mg] ' neomycin. 
The cells expressing the V; and Vy were initially grown separately as overnight pre- 
cultures. The pre-cultures were then combined and diluted in 2SY medium to give 
OD6o00 of 0.02 of each strain and grown for further 65-70 h. The cells were removed 
by centrifugation at 6,000 g for 15min. The recovered culture supernatant was 
adjusted to a final ammonium sulfate concentration of 60% saturation, and the 
precipitate was pelleted, dissolved in TBS buffer (10mM Tris-HCl, pH 7.5, 
150 mM NaCl), and dialysed overnight against the same buffer. Dialysed sample 
was mixed with Ni-NTA resin equilibrated with Fvl buffer (10 mM Tris-HCl, pH 
7.5, 150 mM NaCl, 20 mM imidazole). Bound proteins were eluted with Fv2 buffer 
(10 mM Tris-HCl, pH 7.5, 150 mM NaCl, 250 mM imidazole), mixed with TEV- 
His, and dialysed overnight against TBS buffer. Cleaved His, tag and TEV—Hisg 
were removed by binding to a HisTrap column equilibrated with Fv1 buffer. The tag- 
free Fv fragment was concentrated and loaded onto a HiLoad16/60 Superdex 75 
column (GE Healthcare) equilibrated with TBS buffer. Peak fractions were pooled, 
concentrated, flash frozen in liquid nitrogen, and stored at —80 °C. 

Transport activity of reconstituted GLUT5. The proteoliposome D-fructose 
uptake assay was modified from that previously described for human GLUT1 
(ref. 46). In brief, purified rat GLUT5 was reconstituted by the freeze-thaw/ 
extrusion method. Crude lipids were extracted from bovine liver and sonicated to 
make unilamellar liposomes. 500 pl of a mixture containing ~ 10 jg of purified 
GLUTS5 and 20 mg of liposomes in 10 mM TrisSO, (pH 7.5) was flash frozen and 
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thawed at room temperature. Large, unilamellar proteoliposomes were prepared by 
extrusion (LiposoFast, Avestin; membrane pore size, 400 nm). For each time point, 
10 ul proteoliposomes (0.4 mg lipid; 0.2 1g GLUT5) was added to 10 ll transport 
buffer containing 10 mM TrisSO, and 2mM MgSO, (pH 7.5) with or without the 
addition of 0.5 mM HgCl. Time course of 0.1 mM [‘“C]-p-fructose transport was 
measured at 25 °C at the indicated time intervals and stopped by the addition of cold 
buffer containing 10 mM TrisSO,, 2 mM MgSO, (pH 7.5) and 0.5mM HgCl, and 
immediately filtered. Non-specific uptake was estimated with 0.1 mM ['4C]-1-glu- 
cose. The radioactivity corresponding to the internalized substrate was measured by 
scintillation counting. Each experiment was performed in triplicate and data points 
shown indicate average values of two technical replicates. 

Substrate specificity. Unless otherwise stated the rat GLUT5 deglycosylation 
mutant (N50Y) used for structure determination is referred to as wild-type-like 
protein (WT). Only rat GLUT5 wild type and mutants showing a monodisperse 
peak in DDM during gel filtration were assessed for their ability to bind sugar by 
tryptophan fluorescence. In all experiments, either purified rat GLUT5 wild type 
or mutants were diluted to 0.06mgml' in purification buffer containing 
150mMNaCh, 20mM Tris pH 7.5, 0.03% DDM. p-fructose, L-fructose, 
D-glucose, D-mannose, D-xylose and D-galactose stocks, that were freshly pre- 
pared in purification buffer, were added to the diluted protein to reach a final 
concentration of 40 mM. After each sugar addition the sample was incubated at 
room temperature for 2min before measuring tryptophan fluorescence. 
Measurements were performed using a Cary Eclipse Fluorescence 
Spectrophotometer (Agilent Technologies), with an excitation wavelength of 
295 nm and the emission spectra recorded in the range of 300-400 nm with a 
5 nm excitation slit in a High Precision Cell Quartz Suprasil 10 x 2 mM cuvette 
(Hellma Analytics). The emission peak of 338 nm was taken as an average of the 
four wavelengths 337.03, 337.96, 339.06 and 340 nm. Each experiment was car- 
ried out in triplicate in the absence and presence of the well-known GLUT 
inhibitor HgCl, (5 mM)*”. To compare mutant binding to D-fructose, nonspecific 
tryptophan fluorescence quenching of 2.5% (as measured with 40mM 
L-fructose) was first subtracted from both wild-type and mutant quenching levels 
before calculating the final percentage of quenching relative to wild type. 
Binding analysis. D-fructose or D-glucose stocks, that were freshly prepared in 
purification buffer, were sequentially added to the diluted protein to a final 
concentration of 40mM. After each sugar addition the sample was incubated 
at room temperature for 2 min before measuring. Tryptophan fluorescence mea- 
surements were performed as above. Each experiment was carried out in triplicate 
in the absence and presence of the known GLUT inhibitor HgCl, (5 mM). The rat 
GLUTS curves were fitted by nonlinear regression using the software Prism. The 
bovine GLUT5 construct used for structural studies binds D-fructose with similar 
affinity as rat GLUT5 with a Ky of 5.5 + 0.6 mM. 

Crystallization. The structure of the Fv fragment was used as a search model for 
the molecular replacement. Crystals of the Fv fragment appeared in the well 
buffer at 20°C containing 0.1M CAPS-NaOH (pH 10.5), 1.6M (NH4)2SOx, 
0.1M LipSOx, 6.5 mM n-nonyl-f-p-glucoside. 

For rat GLUTS, crystals of rGLUT5-Fv complex (in DDM) used for structural 
determination were grown at 20°C by hanging-drop vapour diffusion. A 400 pl 
reservoir containing 33-35% PEG 400, 0.12 M CaCl, 0.1. M Tris-HCl (pH 8.0) 
was equilibrated against a 2 pl drop containing a 1:1 mixture of the complex and 
reservoir solution. After 2-3 weeks of growth, crystals were dehydrated by step- 
wise equilibration of the drops against 400 ll reservoirs containing increasing 
concentration of PEG 400, in steps of 5% and up to a final concentration of 70%. 
Crystals were then flash-frozen and stored in liquid nitrogen. 

For bovine GLUT5, crystals were grown over the course of 2-3 weeks at 4 °C in 
2 ul drops (500 pl well) consisting of the well solution 31-35% PEG 300, 0.125 M 
HEPES pH 8.0, 0.125 M NaCl, 0.125 M LiSO,, mixed first with 70 mM HEGA-10 
(4:1) and then the protein solution (1:1), and placed over the well solution diluted 
with water (4:1) before sealing. For freezing in liquid nitrogen, crystals were 
soaked for 1 min in the mother liquor supplemented with ~ 15% PEG 300. 
Data collection, structure determination and analysis. For the Fv fragment, 
diffraction data were collected at 100 K at SPring-8 beamline BL41XU (Japan) 
and processed using HKL2000 packages and the CCP4 suite’. The crystal 
belonged to the space group C2 with two molecules per asymmetric unit. The 
structure was determined by molecular replacement with Molrep” using the Fv 
portion of an antibody structure (PDB code 1IGC) as a search model. Refinement 
was performed till the Rg. value decreased to ~ 22% with REFMACS. For rat 
GLUT5-Fv complex, diffraction data were collected at 100 K at SPring-8 beam- 
line BL41XU (Japan) and processed using HKL2000 packages** and the CCP4 
suite’. The data set used for structure determination and refinement was gener- 
ated by the combination of 4 data sets from 4 independent crystals. The space 
group was determined to be P2), with two rat GLUT5-Fv complexes per asym- 
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metric unit. The structure was determined by molecular replacement with 
Molrep* using two search models (polyalanine of the transmembrane region 
in the outward-facing conformation of FucP (PDB code 307Q), and the sepa- 
rately determined 1.5A structure of the Fv fragment used in this study). 
Refinement was performed with PHENIX®*! followed by manual examination 
and rebuilding of the refined coordinates using COOT”. Recently determined 
structures of XylE (PDB code 4GC0) and human GLUT1 (PDB code 4PYP) 
helped with the modelling. The 6 N-terminal residues (Metl-Gln6), 
22 C-terminal residues (Ser481—Gln502), and 22 residues (Asn39—Asn60) in 
molecule A and 12 residues (Met45-Arg56) in molecule B in TM2 are not 
included in the structure as they did not have interpretable densities. For bovine 
GLUT5, data were collected on frozen crystals at beamline 102, Diamond Light 
Source (UK). The data set used for structure determination and refinement was 
generated using HKL2000 by processing and scaling together two data sets col- 
lected on two different parts of the same crystal, and by correcting for anisotropy 
(UCLA server, http://services.mbi.ucla.edu/anisoscale/) (Extended Data Tables 1 
and 2 and Extended Data Fig. 1). The structure was solved by molecular replace- 
ment using the N-terminal and C-terminal bundles of rat GLUTS as independent 
search models in PHASER, and refined using REFMAC5 and BUSTER against 
data up to 3.0 A resolution after anisotropy correction”’, with rounds of re-build- 
ing in COOT™. Final refinement was performed using PHENIX® at 3.2 A reso- 
lution (Extended Data Fig. 1). Structural alignments were performed using the 
align command of PYMOL software using Cx coordinates. 

Homology modelling. Models for the outward-occluded and inward-occluded 
conformations were based on corresponding XylE crystal structures (4GBY and 
4JA3, respectively) and were produced using Modeller 9v12 (ref. 54). Multiple 
alignment of GLUT5 homologues (Homo sapiens, Canis lupus, Rattus norvegicus, 
Felis catus, Gallus gallus, Anolis carolinensis) and xylE sequences were combined 
with structural data from the XylE structures and the GLUTS structures using 
expresso mode of t-coffee*. The multiple alignment in combination with helical 
restraints corresponding to the TMs of bovine GLUTS structure was used as an input 
for Modeller. For each conformation, 20 models were generated with loop 
optimization. The DOPE scoring function was used to select the final model. 
Outward-facing bovine GLUT5 was modelled with the same protocol based on 
the rat outward-facing GLUTS structure and alignment of GLUTS sequences. The 
four conformations (outward-open, outward-occluded, inward-occluded and 
inward-open) were morphed and rendered using PyMOL (http://www.pymol.org/). 
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Extended Data Figure 1 | Anisotropy descriptors of bGLUT5 data reported __b, Representative portions of the electron density map (1.50) for bGLUTS5 

by the UCLA-MBI Diffraction Anisotropy Server and 2F, — F, electron overall model (left) and a close-up of the substrate binding site (right); residues 
density maps for the bovine and rat GLUTS structures. a, Degree of highlighted are numbered based on rGLUTS for the sake of clarity. c, Electron 
anisotropy of bGLUTS data, resolution limits for the 3 principal axes (left), and density (1.0c) for rGLUTS5 showing one of the inter-bundle salt-bridge clusters 
panel illustrating steps along correction of bGLUTS5 data for anisotropy (right). _ that form in the open outward-facing conformation. 
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Extended Data Figure 2 | Superimposition of open inward-facing bGLUTS5 __ structures and highlighted with the dotted ellipse. The beginning of TM1 
and hGLUT1 structures, and comparison of the substrate-binding site in kinks further outwards in the bGLUT5 structure compared to hGLUT1 and 
bGLUTS5 and inward-facing XylE a, Ribbon representation of inward-facing __ residues 1-18 could not be built. The r.m.s.d. (root mean square deviation) after 


bGLUTS (coloured as in Fig. 1a) and inward-facing hGLUT1 (light grey) superposition of the two structures is 1.12 A for 364 pairs of Ca atoms (see 
structures, as viewed in the plane of the membrane. The D-glucopyranoside Methods). b, The substrate-binding in the inward-facing bGLUTS structure 
moiety of the detergent molecule bound to GLUT1 (n-nonyl-B-p- (coloured as in Fig. 1) is very similar to that seen in inward-facing XylE (4JA4) 
glucopyranoside (B-NG)) is shown in stick representation. Density for ICH5 at structure (shown in light grey). Only non-conserved residues and the 

the C terminus is missing in both hGLUT1 and bGLUTS inward-facing equivalent glutamine to Q166 are labelled for XylE. 
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Extended Data Figure 3 | Structure of the rat GLUT5-Fv complex. 

a, Cartoon representation of the complex between rGLUTS (grey) and 
4D111Fv (heavy-chain variable region (Vy) is in blue; light-chain variable 
region (Vj) is in red). 4D111Fv binds to the cytoplasmic domain of GLUT5, 


including ICH2 (residues 226, 230, 234), the loop between ICH2 and ICH3 
(residues 238, 240, 241), and ICH3 (residue 243), with ~ 848 A’ of buried 
surface area at the interface. b, Packing of the rat GLUT5-Fv complex 
molecules in the crystal. The unit cell is represented as green lines. 
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Extended Data Figure 4 | Sequence alignment of rat GLUT5 (rGLUT5), 
bovine GLUT5 (bGLUT5), human GLUT5 and GLUT7 (hGLUT5 and 
hGLUT7), human GLUT1-4 (hGLUT1-4), Saccharomyces cerevisiae 
HXT7, Plasmodium falciparum PfHT), Arabidopsis thaliana GlcT and 
Escherichia coli XylE. Structure elements of rat GLUTS are indicated above 
the alignment, and coloured as in Fig. 1a. Strictly conserved residues are 
highlighted in black-filled boxes, and highly conserved residues are shaded in 
grey. Green boxes highlight central cavity residues that are specific to GLUT5 
and red boxes highlight those that are conserved among GLUTs. Purple boxes 
highlight residues forming the salt bridges between cytosolic TM segments. A 
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blue box (TMS) highlights Gln166, whose mutation to glutamic acid, as present 
in GLUT7, weakens D-fructose binding but supports strong D-glucose binding 
in rGLUT5. The brown box (TMS) highlights Glu336 that is conserved across 
all the GLUTs and replaced with glutamic acid in XylE. Red bars underneath 
the alignment indicate the sugar porter (SP) family motifs'*'’. Note that 
because bGLUT5 and hGLUTS5 have an additional amino acid at position 8, 
their numbering differs from rGLUTS by 1 amino acid. For clarity, bGLUT5 
residues are labelled using rGLUT5 numbering. 
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Extended Data Figure 5 | D-fructose binding monitored by tryptophan 
fluorescence quenching. a, Cartoon representation of the outward-facing 
rGLUTS5 structure, as viewed from the plane of the membrane with the 
colouring as shown in Fig. 1a. Atoms in all tryptophan residues are shown as 
spheres and tryptophan W419, whose fluorescence is quenched by substrate, is 
labelled. b, Emission fluorescence spectra for purified deglycosylated rGLUTS 
wild-type-like mutant N50Y (referred to as WT), shown in the range of 320- 
360 nm with an excitation wavelength of 295 nm after the addition of 40 mM 
D-fructose (top), and 40 mM L-fructose (bottom). Emission fluorescence 
spectra for purified wild-type protein that had been previously incubated with 
the inhibitor HgCl, is also shown for D-fructose (middle). c, Tryptophan 
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fluorescence quenching (excitation 295 nm; emission 338 nm) after incubation 
of purified rGLUT5 NSOY with either 40 mM p-fructose (filled bar) or 
L-fructose, D-glucose, D-mannose, D-xylose or D-galactose as labelled (open 
bars). Tryptophan fluorescence quenching for purified wild-type protein that 
had been previously incubated with the inhibitor HgCl, is also shown for 
D-fructose (open bar). d, As in ¢, but for rGLUTS5 with a single tryptophan 
residue (W419), which contains the following mutations: N50Y, W70F, 
W191F, W239F, W265F, W275F, W338F and W370F. No tryptophan 
quenching was observed for D-fructose (5 mM HgCl,), L-fructose, D-glucose or 
p-galactose. In all experiments errors bars indicate s.e.m.; n = 3. 
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Extended Data Figure 6 | Substrate specificity in GLUT5. a, Time- 
dependent uptake of p-[’*C]-fructose by rGLUT5 wild type (open squares and 
triangles) and the deglycosylated mutant N50Y (filled squares and triangles) in 
proteoliposomes incubated with or without the inhibitor HgCl, as labelled. 
Non-specific uptake was estimated with 0.1 mM t-['*C]-glucose for wild type 
(filled circles) and the N50Y mutant (open circles). In all experiments errors 
bars represent a spread of duplicates. Inset shows SDS-PAGE analysis of the 
purified rat GLUT5 wild type and the deglycosylated N50Y mutant. 

b, Tryptophan fluorescence quenching (excitation 295 nm; emission 338 nm), 
after incubation of purified rat GLUT5 mutant (N50Y, W70F, W191F, W239F, 
W265F, W275F, W338F, W370F) that contains one single tryptophan 
residue, W419, with increasing concentrations of D-fructose (filled squares) 
and to the protein previously incubated with the inhibitor mercury chloride 


200 


em . . Bottom 
’ XylE 


=. « «Bottom 
GLUT5 


single Trp419 mutant 
6) D-fructose 
Ky=6+2mM 


= 


ad 


% tryptophan fluorescence quenching 


D-fructose 
(HgCl,) 
0. : o—o ° ° 2 
0 10 20 30 40 50 


D-fructose (mM) 


% tryptophan fluorescence quenching 


relative to wildtype 


WT 1295V 


I295A 


Y296A Y297A 


(open circles). ¢, Slab through the surface of the outward-facing rGLUTS 
structure as viewed in the plane of membrane. The structure of substrate-bound 
XylE structure was further superimposed onto rGLUTS5 and is shown here as 
a grey ribbon. In XylE, Trp392 (Trp388 in hGLUT1) is located at the 

bottom of the cavity (spheres; magenta) and coordinates D-xylose (stick form; 
yellow). In GLUTS, the equivalent residue is an alanine, making the cavity 
deeper. d, D-fructose binding as measured by tryptophan fluorescence 
quenching (excitation 295 nm; emission 338 nm) after incubation with 40 mM 
D-fructose for wild type (open bar), and TM7 mutations of Ile295 (interacts 
with TM10 residues) and Tyr296 and Tyr297 residues. Equivalently located 
tyrosine residues in XylE occlude the sugar-binding site from the outside”. 
Fluorescence quenching for the mutants are displayed as a percentage of total 
wild-type binding. In all experiments errors bars indicate s.e.m.; n = 3. 
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Extended Data Figure 7 | The intracellular helical domain (ICH). 

a, Cytoplasmic view of the ICH domain after superposition of the open, 
outward-facing rGLUT5-sFv (grey) and outward-facing occluded E. coli XylE 
(teal) (4GBY) structures. b, In the outward-facing GLUTS5 structure ICH1- 
ICH3 are linked together by several salt bridges (side chains are labelled and 
shown as sticks in yellow). In contrast, no polar interactions are formed 
between ICHS and either ICH1-ICH3 or cytoplasmic ends of N-terminal TM 
bundle helices. A salt bridge forms (dotted line in magenta), however, between 


Glu225 in ICH3 and Arg407 in TM11, which also forms part of the inter- 
bundle salt-bridge network (side chains are labelled and shown as sticks in 
cyan). c, In the inward-facing GLUTS structure, this inter-bundle salt-bridge 
network is not formed, because the cytoplasmic ends of the N- and C-terminal 
bundle have moved apart; consistently, the ICH domain functional role is 
proposed to act as a scaffold domain that further helps to stabilize the outward- 
facing conformation”’. 
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Extended Data Figure 8 | Access to the central cavity and substrate-binding 
site is gated by TM7 on the outside and TM10 on the inside. a, Superposition 
of outward-facing open GLUT5 and outward-facing occluded E. coli XylE 
(4GBY) structures. The TM numbering for outward-facing occluded XylE has 
an additional asterisk. The inward-facing GLUTS5 structure is coloured as in 
Fig. 1a and that of XylE in grey. The bound D-xylose is shown in stick 
representation in green. The r.m.s.d. is 1.38 A for 290 pairs of Co, atoms 

(see Methods). b, Superposition of inward-open GLUT5 and inward-occluded 


E. coli XylE structure (4JA3) with colouring and annotation as described in 
a. The r.m.s.d. is 1.80 A for 274 pairs of Cx atoms (see Methods). The bound 
D-xylose in 4GBY is represented in stick form in green. The ICH domain is 
not shown for clarity. c, Superposition of inward-facing open GLUT5 and 
inward-facing open XylE (4JA4) structures as viewed from the cytoplasmic side 
with colouring and annotation as described in a. The ICH domain is not shown 
for clarity. The r.m.s.d. is 1.70 A for 273 pairs of Cx. atoms (see Methods). 


©2015 Macmillan Publishers Limited. All rights reserved 


Extended Data Table 1 | Crystallographic data collection and refinement statistics 
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Data collection 
Space group 
Cell dimensions 

a, b, c (A) 

a, B, y (°) 
Resolution (A) 
Reym OF Rimerge 
lol 
Completeness (%) 


Redundancy 


Refinement 
Resolution (A) 
No. reflections (Rfree set) 
Ruorki Riree 
No. atoms 
Protein 
B-factors 
Protein 
R.m.s deviations 
Bond lengths (A) 
Bond angles (°) 
Ramachandran plot (%) 
Favored 


Outliers 


rat GLUT5-Fv 


P2, 


76.78, 151.54, 106.40 
90.00, 97.25, 90.00 
50-3.27 (3.39-3.27) 
21.5 (>100) 

11.6 (1.5) 

100 (100.0) 


13.0 (13.0) 


50-3.27 (3.5-3.27) 


38017 (3355) 


24.2/28.8 (35.4/37.5) 


10657 


157.0 


0 


bovine GLUT5 


P2422) 


74.61, 112.15, 139.57 
90,00, 90.00, 90.00 

100 - 3.00 (3.11 - 3.01)* 
10.06 (>100) 

11.21 (0.86) 

99.4 (93.9) 


10.2 (8.6) 


33.3 - 3.2 (3.31 - 3.20) 


13346 (1331) 


23.6/25.8 (32.8/36.5) 


3382 


149.2 


0 


ees 


Data were obtained by scaling together two datasets collected on the same crystal 


*Highest resolution shell used in the final refinement is shown in parenthesis. 
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Extended Data Table 2 | Completeness of bovine GLUT5 data per resolution shell after correction for anisotropy 


Resolution range (A) Completeness (%) 
100.0 - 8.10 95.5 
8.10 - 6.43 100.0 
6.43 - 5.62 99.9 
5.62 - 5.10 99.9 
5.10 - 4.74 100.0 
4.74 - 4.46 99.7 
4.46 - 4.24 99.6 
4.24 - 4.05 100.0 
4.05 - 3.90 78.4 
3.90 - 3.76 53.4 
3.76 - 3.64 40.6 
3.64 - 3.54 33.4 
3.54 - 3.45 26.3 
3.45 - 3.36 21.2 
3.36 - 3.29 15.4 
3.29 - 3.22 13.1 
3.22 - 3.15 9.4 
3.15 - 3.09 6.0 
3.09 - 3.04 3.5 
3.04 - 3.00 1.7 
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Two independent and primitive envelopes of the 
bilobate nucleus of comet 67P 
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The factors shaping cometary nuclei are still largely unknown, but 
could be the result of concurrent effects of evolutionary’? and 
primordial processes**. The peculiar bilobed shape of comet 
67P/Churyumov-Gerasimenko may be the result of the fusion of 
two objects that were once separate or the result of a localized excava- 
tion by outgassing at the interface between the two lobes’. Here we 
report that the comet’s major lobe is enveloped by a nearly continu- 
ous set of strata, up to 650 metres thick, which are independent of an 
analogous stratified envelope on the minor lobe. Gravity vectors 
computed for the two lobes separately are closer to perpendicular 
to the strata than those calculated for the entire nucleus and adj- 
acent to the neck separating the two lobes. Therefore comet 67P/ 
Churyumov-Gerasimenko is an accreted body of two distinct objects 
with ‘onion-like’ stratification, which formed before they merged. 
We conclude that gentle, low-velocity collisions occurred between 
two fully formed kilometre-sized cometesimals in the early stages 
of the Solar System. The notable structural similarities between the 
two lobes of comet 67P/Churyumov-Gerasimenko indicate that the 
early-forming cometesimals experienced similar primordial strati- 
fied accretion, even though they formed independently. 

Soon after the insertion of the robotic space probe Rosetta into orbit 
around comet 67P/Churyumov-Gerasimenko on 6 August 2014, 
images acquired by the Optical, Spectroscopic, and Infrared Remote 
Imaging System (OSIRIS)° have provided evidence of morphological 
terraces distributed all over its surface’. In the following months, 
imaging data from both the Narrow Angle Camera (NAC) and Wide 
Angle Camera (WAC) have confirmed that these features, together with 
cuesta-like morphologies, are related to a pervasive stratification at the 
surface of the nucleus (Figs 1, 2 and Extended Data Figs 1-4). (A cuesta is 
an asymmetric ridge characterized by a long and gentle backslope that 
conforms with the dip of a resistant stratum called cap-rock, and a steep 


frontal slope.) Terraces and localized strata are not new features of 
cometary nuclei, having been observed at the surface of comet 9P/ 
Tempel 1 (refs 8 and 9) and, possibly, of comets 81P/Wild 2 (ref. 10) 
and 19P/Borrelly (ref. 11). Their origin has been explained either as 
the result of sublimation, erosion and redeposition of lag materials 
at the surface’, as successive flows of fluidized material’, or as the surface 
expression of an inner stratified structure related to primordial 
accretion**. However, the OSIRIS images of comet 67P/Churyumov- 
Gerasimenko provide evidence of global stratification with an imaging 
resolution of up to 0.1m per pixel, greatly exceeding that of previous 
close-range images, the best of which are around 7 m per pixel’*” (Figs 1, 
2 and Methods section “Data and surface strata description’). 

The structure delineated by the strata and the relationship between 
gravity vectors and strata orientations provide information that we can 
use to disentangle the main formation hypotheses concerning the 
unusual bilobed shape of the nucleus of comet 67P/Churyumov- 
Gerasimenko. Is its shape the manifestation of a concentrated localized 
erosion of a single monolithic body into a two-lobe configuration or 
the expression of two distinct accreted objects? What are the implica- 
tions in terms of comet origin? 

We were able to retrieve the strata orientations of both the major 
lobe (the main body) and the minor lobe (the head) of the comet by 
using the best-fitting planes of terraces and cuesta-like features 
obtained from a stereo-photoclinometry shape model covering 75% 
of the comet nucleus”’ (Methods section ‘Best-fitting planes’). The 103 
best-fitting planes throughout the entire comet nucleus that we derived 
emphasize that the strata are coherently oriented all around the comet, 
suggesting two separate wrapping sequences for the head and the main 
body (Fig. 3 and Extended Data Figs 5 and 6). 

The strata are ubiquitous at the surface, vertically outspread along 
scarps, parallel to each other, and laterally continuous (Methods 
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Figure 1 | The main body. a, Overview of the Seth 
region, showing lineated cliffs and terraces. 

Other regions and the location of the candidate 
landing site A (one of the five locations identified 
in August 2014 as candidates to set down the Philae 
lander) are also shown. The dashed square is the 
location of Fig. 2d. b, Lateral view of site A with 
Seth terraces in the background (white arrows). 
Notice the strata within the pit at the margin of 
site A. c, Seth-Anubis contact. Dashed white lines 
are strata. The long white arrow is a terrace in the 
Atum region. Small white arrows indicate the 
terrace margin in Anubis. d, WAC view of 

the Imhotep region. The plateau structure B and 
the stratification of bounding regions (white 
arrows) are evident. e, NAC view of Imhotep 
staircase terraces (white arrows) extending 
underneath the Imhotep southern boundary strata. 
The eye symbol indicates the view of Extended 
Data Fig. 2d. 


Figure 2 | The head of comet 67P/Churyumov- 
Gerasimenko. a, The Hathor region, showing 
strata (dashed red lines) and fractures (dashed 
blue lines). In the foreground the cuesta-like 
morphologies of the Ash and Babi regions are 
visible. No abrupt change of strata orientation 
occurs between Hathor and Ma’at. b, Stratification 
at the Hathor—Anuket boundary. Terraces 

(white arrows) show that strata are continuously 
changing their orientation. c, WAC overview of 
Anuket. The dashed white line underlines the scarp 
where Anuket extends underneath Ma’at. The 
dashed box is the location of b. d, NAC view of the 
Serquet and Nut regions: Serquet terraces (white 
arrows) are parallel to the Nut planar region. 
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Figure 3 | 3D views of the nucleus and strata of comet 67P/Churyumov- 
Gerasimenko. a, Entire nucleus; b, separated lobes. Best-fitting planes derived 
from strata are shown for the complete shape (left) and alone (right). The 


section ‘Surface strata description’), so we were able to obtain a series of 
geological sections by cutting the comet nucleus along two different 
sets of planes: one along the major axis and one perpendicular to it 
(Methods subsection ‘Geological sections’). The geological sections 
transecting both the main body and the head show that the strata of 
the two lobes are clearly independent of each other, and define two 
envelopes not only on the surface but underneath, locally reaching a 
depth of 650 m (Fig. 4a and Extended Data Figs 7 and 8). The separa- 
tion is particularly evident in the opposite inclination of strata at the 
neck flanks (Fig. 4a and Extended Data Figs 7 and 8). Similarly, strata 
clearly define a mantling envelope on the sections perpendicular to the 
major axis of the body (Fig. 4b, c and Extended Data Fig. 9). 

The thicknesses of the stratified sequence (several hundred metres 
beneath most regions) we derived should be considered lower bounds 
because the stratification might continue to greater depths. The thick- 
ness of the stratified rocky-like material excludes by itself any causative 
process related to weathering during the evolution of comet 67P/ 
Churyumov-Gerasimenko. Layering from metamorphic waves 
should also be ruled out, since metamorphic fronts generate gradual 
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colour scales indicate the angular deviation between the plane vector and the 
local gravity vector calculated for the whole body (a) and the separated lobes 
(b). The red point in b is the contact point between the two lobes. 


transitions and not the distinct strata with sharp boundaries that are 
seen on comet 67P/Churyumov-Gerasimenko. 

We can constrain the structural reconstruction further by compar- 
ing the orientation of the planes fitting terraces and cuesta-like features 
with the local gravity-field vectors computed for either the whole 
comet nucleus or for the two lobes as if they are independent 
(Methods section “Gravity-field vectors’). In Figs 3 and 5 the angular 
relationship between the planes of the strata and the gravity-field 
vectors are visualized through three-dimensional (3D) reconstructions 
and diagrams. The histograms of Fig. 5 show the distribution of the 
absolute angular deviations between the local gravity vector and the 
planes normal to the strata at distances lower and higher than 1.7 km 
from the contact point of the two separated (and reconstructed) lobes 
(red point in Fig. 3b). The angular values and the standard deviations of 
the strata close to the neck (distances = 1.7 km) are higher if the gravity 
field of the entire comet nucleus is considered with respect to the two 
separated objects. In contrast, no major differences are recorded 
between the two gravity models far from the neck region (distances 
>1.7km), where both angular values and standard deviations are low. 
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Figure 4 | Geological sections through comet 67P/Churyumov-Gerasimenko. 
a, Longitudinal section along the comet’s major axis (y = 0). b, c, Transversal 
sections perpendicular to the comet’s major axis (x = 0, y = 0). In the top 
row are perspective views of the comet nucleus showing the different cross- 
sectional slices. Shown in red are the best-fitting planes projected into the 
geological section; shown in blue are all the other best-fitting planes. In the 


404 | NATURE | VOL 526 | 15 OCTOBER 2015 


-1 05 0 05 1 
Distance from the major axis (km) 


0 05 4 15 2 


bottom row are geological sections. Shown in blue are the best-fitting planes; 


green arrows are vectors normal to each plane; yellow shading is the field of 


lines used for drawing strata; dashed black lines indicate strata; dashed grey 
lines indicate inferred strata. Further geological sections are in Extended Data 


Figs 7-9. The traces of transversal sections are reported in the longitudinal 
one and vice versa. 
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Body (>1.7 km) 11.56 9.314 15.17 8.303 
Head (>1.7 km) 8.853 6.855 10.05 8.083 


Figure 5 | Strata and local gravity vectors. Distribution of angular deviations 
between the local gravity vector and the normal-to-strata planes at distances 
lower and higher than 1.7 km from the point of intersection between the 

two reconstructed lobes (red point in Fig. 3b). The diagrams show histograms 
of the angular deviations as a percentage of the total best-fitting planes at 
distances lower and higher than 1.7 km for the body and the head respectively. 
Gaussian distributions normalized to the bin dimensions are overplotted on 
the histograms to emphasize the different means and standard deviations o 
(also reported in the table). The angular values are referred to the local gravity 
vector calculated for the complete comet nucleus (dark grey bars and black 
curve) as well as for the two separated objects (pale grey bars and grey curve). 


Hence the detected stratification seems to be best modelled as being 
perpendicular to the gravity vectors in each of the lobes. This would 
suggest that the stratified envelopes of the head and the main body 
formed independently and before the aggregation of the two lobes. 

OSIRIS data have revealed that stratification is a dominant structural 
aspect of comet 67P/Churyumov-Gerasimenko throughout its entire 
bilobed shape, with the only exception being the neck region separating 
the main body from the head. The structural reconstruction of the 
comet nucleus based on the orientation and distribution of the 
identified strata shows that the two lobes were two independent objects— 
perhaps formed as pebble-pile planetesimals consisting of aggregates 
of primordial pebbles from the Solar nebula’*—characterized by onion- 
like stratification several hundred metres thick. The cuesta-like mor- 
phologies, derived from differential sublimation of the stratified 
sequences (Methods section ‘Surface strata description’), imply varia- 
tions in the relative abundances of volatile materials among strata. The 
relationship between strata plane orientations and gravity-field vectors, 
calculated for the entire comet and the two separated (and recon- 
structed) lobes, suggests that these envelopes formed independently 
before the fusion of the two objects (the head and the main body) into 
a single consolidated nucleus. The ordered sequence of wrapping strata 
precludes a chaotic structure up to a depth of several hundred metres, 
suggests that stratification is a primary structure, and implies a merging 
of two fully formed kilometre-sized cometesimals in the early stages of 
the Solar System via a low-velocity impact’. The two lobes are strikingly 
similar in terms of deep onion-like structure, surface composition’ and 
observed surface features’. Taken together, the structural and composi- 
tional similarities indicate that the two cometesimals experienced sim- 
ilar accretion processes, having formed completely independently. 
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Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Data. For the observational work and geological mapping we used both OSIRIS 
NAC and WAC images. The NAC images used for this work were acquired soon 
after the orbit insertion on 6 August 2014 up to 17 March 2015 and have a spatial 
scale ranging from about 2m per pixel to 0.1m per pixel depending on the 
cometocentric distance of acquisition (0.1 m per pixel for the 14 February 2015 
close fly-by); WAC data were considered only for distances less than 45 km and 
have a spatial scale ranging from about 4.5 m per pixel to 2.5 m per pixel (Extended 
Data Table 1). Lower-resolution NAC images and all WAC images provided 
synoptic views of regional sectors of the whole nucleus and were useful for the 
broad mapping of strata and terraces. High-resolution NAC data were instead 
used to unravel details of the stratigraphic sequences and their presence within pits 
and along steep walls in several locations at the nucleus surface. 

Surface strata description. (See also Supplementary Information section 
“Concept of strata applied to the 67P/C-G comet nucleus’). The deep depression 
(the ‘neck’) separating the larger lobe (the ‘main body’) from the smaller one (the 
‘head’) allows for detailed views of both sides, providing insight into the inner 
structure of the two lobes. For this reason we describe the main stratified structure 
of the comet starting from the neck and going towards the main body and the head. 
Our observations refer to the regional physiographical distinctions defined by the 
OSIRIS team”’’. These morphological regions are probably the results of uneven 
fracturing, weathering and sublimation processes on the comet. The differential 
action of these processes on the primary structure most probably shaped the 
regions now visible on the comet nucleus’’. 

The main body has a size of 4.1 km X 3.3 km X 1.8 km along three perpendic- 
ular directions’ and shows several morphologies indicating inner stratification. 
The most remarkable ones are the terraces and cliffs of the Seth region facing the 
neck region (Hapi) (Fig. 1a). 

The Seth region includes terraces at different scales; these are always aligned 
with other terraces and lineaments, indicating a continuous inner stratification 
(Fig. la and Extended Data Fig. la-c). An impressive example is given by the 
terrace at the candidate landing site labelled A (a plain 400 m long and 600 m wide 
delimited by two steep walls) (Extended Data Fig. 1d). The upper wall (400 m high) 
shows smaller-scale aligned terraces joined by linear discontinuities that are stra- 
tigraphically traceable for hundreds of metres. These small terraces have orienta- 
tions similar to the main terrace and coherent with the lineaments of the lower wall 
(300 m high). On one side of the main terrace there is a pit 220 m wide and 185 m 
deep, characterized by strata as thin as 3m that evidently represent the inner 
skeleton of the comet nucleus up to at least some hundreds of metres deep 
(Fig. 1b). Again, the orientation of this stratified sequence is compatible with 
the one visible in the main terrace and the walls. Hence, the average thickness 
enclosed between terraces is not representative of the thickness of the elemental 
strata, which is much thinner, a few tens of centimetres to a few metres (compatible 
with the particle sizes of the accreting materials’). 

The Seth terraces shown by the OSIRIS images appear to gradually change their 
orientation, that is, they become more inclined and ‘dip’ towards the neck region 
(Hapi), and are less inclined towards the Ash region. This general trend is illu- 
strated by the lateral boundary separating Seth and Ash from Anubis and Atum 
(Fig. 1c). This boundary consists of a continuous scarp whose maximum elevation 
apparently constrains the stratigraphic thickness of the Seth region to 150m. 
However, the presence of terraces in Anubis underneath the smooth deposits 
(Fig. 1c) suggests that the stratification probably reaches greater depths from 
the surface. 

The Ash region, which connects Seth (facing the head) to Imhotep (on the 
opposite site), is characterized by several cuesta-like and mesa-like features 
(Fig. 1d and Extended Data Fig. 2a—c). On Earth, cuestas are formed by differential 
erosion of a stratified sequence of differently erodible strata inclined less than 30° 
with respect to the surrounding average topography"*. The orientation of the strata 
is coincident with the dip and the dip direction (that is, the direction of maximum 
inclination) of the gentle slope. Mesas originate via the same process but are char- 
acterized by horizontal terraces. Similarly, the cuesta-like and mesa-like morphol- 
ogies on the comet nucleus could have been generated by a differential erosional 
sublimation of a stratified sequence of strata with different volatile contents. 

Across the Ash region, the strata gradually change their dip direction, assuming 
an inclination opposite to that of the Seth terraces and slightly dipping towards the 
Imhotep depression. The low dip angles of Ash stratification favour the formation 
of sequences of aligned flat terraces that can be followed for hundreds of metres to 
kilometres in mesa-like morphologies (Fig. 1d and Extended Data Fig. 2a-c). 

At the opposite site of Anubis, Seth and Ash grade into the Babi region’. The 
gradual passage between these provinces means that no major change and break 
occur in the inner structure, suggesting it is constituted by a continuous inner 
stratigraphy. Indeed, Babi is characterized by terraces and strata at all scales 
(Extended Data Fig. 3). 


The Imhotep depression is entirely surrounded by scarps with evidence of 
stratification (Fig. 1d and Extended Data Fig. 2a and b), suggesting a stratified 
structure for all the confining provinces. Hence, Kephry and Apis also show 
evidence of a pervasive inner stratification. The elevation of the Imhotep bounding 
scarps is around 200 m, and this would roughly define the minimum thickness of 
the stratified structure in the surrounding regions. Within Imhotep itself there are 
several orders of staircase terraces underneath smooth materials physically con- 
nected with the strata at Imhotep boundaries (Fig. le and Extended Data Fig. 2a, b 
and d). Strata within Imhotep must be considered as a fundamental structure. 

Surface-emplaced geological units on comets could be sequences of air-fall 
deposits of lag materials' and successive flows of fluidized materials’. Strata of 
both air-fall deposits and fluidized flows should display smooth and gradual 
terminations, and not sharp boundaries as do the steps (terrace margins), shown 
in metre-to-centimetre scale in Fig. le and Extended Data Figs 2d and e. On the 
other hand, later erosive processes might form angular boundaries on previous 
smooth terminations. However, flow features have not been detected at any scale 
on OSIRIS images of the Imhotep area and this observation definitively rules out 
the Belton and Melosh mechanism’ of emplacement in this region. Moreover, the 
terrace margins within the Imhotep depression are connected by flat planes cover- 
ing several square kilometres; any air-fall deposit would instead mantle the very 
irregular pre-existing topography, making the formation of such continuous flat 
areas very unlikely. All of this evidence proves that air-fall deposits, represented by 
the loose material on top of the stratified sequence, cannot explain the staircase 
succession of terraces in Imhotep. These terraces clearly connect the terrace mar- 
gins with the strata of the bedrock walls bounding the Imhotep region, and even 
the lateral prosecution of the terraces in the smooth terrain are connected to the 
bedrock stratification (Fig. le). In Extended Data Fig. 2d the strata forming the 
largest roundish plateau at the Imhotep boundary must be exhumed, since they 
dip underneath the loose mantling material. All these observations mean that the 
stratified inner structure is thicker than is expected from looking at the average 
elevation of the scarp bounding the Imhotep depression. 

To summarize, all regions of the main body are characterized by strata and 
terraces with the exception of Aten, which is an elongated depression covered by 
debris. Several different views (Fig. 1c, d and Extended Data Fig. 2a and b) suggest 
that the comet is at least partially wrapped in strata. 

The most spectacular section across the minor lobe (2.6 km X 2.3 km X 1.8 km 
in size*) is Hathor, a wall 900 m high and 1,500 m wide displaying about one third 
of the head’s inner structure’ and facing the Hapi region. Hathor cliffis made up of 
a highly pervasive stratification demonstrated by lineaments with a regular spacing 
and intimately associated with small terraces (Fig. 2a). Strata are crosscut by a 
perpendicular set of lineaments that can be interpreted as fracture systems since 
they are not regularly spaced, are not associated to any terrace, and are normally 
continuous and not interrupted by strata. 

At the sharp topographic boundary between Hathor and Ma’at no abrupt 
change of strata orientations is recorded (Fig. 2a). In contrast, the same stratifica- 
tion seems to be involved in a change in orientation on both sides of the main wall 
of Hathor, where they are steeply inclined (Fig. 2a). However, the small terraces 
within Hathor are slightly dipping towards Hapias well and, at different views, the 
lateral variation on strata attitudes between Hathor and the confining Anuket 
region appears to be continuous. The alcove carved at the contact between 
Hathor and Anuket”’” marks the change in the strata orientation, providing a 
good lateral section of what comprises the Anuket inner structure (Fig. 2b). As 
usual for any topographic surface parallel to strata, Anuket (as well as Apis and 
Khepry in the main body; Extended Data Fig. 2b) displays only some, although 
meaningful, examples of terraces (Fig. 2b and c). Thomas et al.’ have pointed out 
how Anuket materials seem to extend underneath the Ma’at region (Fig. 2c). This 
explains the continuous change in orientation of strata between Anuket and 
Hathor/Ma’at, and inherently implies that the stratification visible on the Hathor 
900-m-high wall is actually composed, in its upper portion, of Anuket and then 
Ma’at materials. At the same time, this seems to reinforce the observation that 
Hathor is representative of a wide part of the head’s inner structure that has been 
exposed because of the dismantling (by erosion or detachment) of a wide portion of 
the head whose remnant is the distinctive Anuket region that we see today. 

Like Ash in the main body, Ma’at, Maflet and Bastet are also characterized by 
cuesta- and mesa-like morphologies, often affected by gravitational niches and 
distributed in staircase patterns (Extended Data Fig. 4a). The orientation of the 
related terraces varies very gradually across these regions and no abrupt changes 
are recorded. The Hatmehit depression is carved from these regions and its 
bounding scarps give good views into the subsurface structures of the confining 
regions. In particular, in OSIRIS context images centripetal patterns are apparent, 
mainly at the boundary between Hatmehit and Ma’at (Extended Data Fig. 4a). 
However, at a closer view these features appear to be the result of composite 
morphologies resulting from the interplay of inclined terraces caused by dipping 
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strata and fractures (Extended Data Fig. 4b and c). This further indicates that 
Mavat, Maflet and Bastet contain a stratified sequence at least as thick as the depth 
of the Hatmehit depression (around 150m). However, the prominent step 
revealed within Hatmehit, which does not extend outside the depression 
(Extended Data Fig. 4a), appears to be the expression of a buried terrace rather 
than of a later deformation (Extended Data Fig. 4d and e). This would imply a 
stratified sequence thicker than the maximum elevation of the scarps over the 
depression. 

Finally, the Nut and Serget regions can be considered from a structural point of 
view as a unique system. In fact, the Serget scarp facing Nut displays strata 
emphasized by small terraces whose orientations are parallel to the overall trend 
of the Nut flat surface representative of a large strata plane itself (Fig. 2d). 

In summary, all regions of the minor lobe except Hatmehit, which shows a 

buried terrace, appear to be characterized by stratified sequences. Moreover, the 
structural relationship between Hathor, Ma’at and Anuket as well as the orienta- 
tion of terraces bounding Hatmehit and the cuesta-like morphologies of the other 
regions suggest that a stratified envelope characterizes the head as well. 
Best-fitting planes. All the procedures of plane extraction and 3D geological 
reconstructions of strata attitudes were implemented using MeshLab (http:// 
meshlab.sourceforge.net/) and Matlab software on a shape model obtained 
through stereo-photoclinometry and covering 75% of the entire surface. The shape 
model is characterized by 3.6 million facets at 6 m sampling. The best-fitting planes 
were extracted on the bases of the shape nodes of the terraces. The number of 
points used for best-fitting is 420 on average. Extended Data Figure 5 shows the 
distribution of the ratios between the standard deviations of the 3D coordinates in 
the local reference system (where the z axis is normal to the surface). The mean 
ratio is 0.0574, which corresponds to a maximum error on the dip angle and the 
dip direction of the best-fitting planes lower than 2°. 
Geological sections. The stereo-photoclinometric shape model has been the 
source for retrieving cross-sectional topographic profiles parallel (longitudinal) 
and perpendicular (transversal) to the major axis of the comet nucleus. On the 
derived topographic profiles we projected all the terrace-related planes (that is, 
best-fitting planes) whose centres were at a distance not exceeding 500 m from the 
plane of the section. (A similar projection of surface measurements into specific 
topographic profiles was used for the geological sections of the 67-km-long 
Brenner Basis Tunnel’®”°, which will cross through the European Alps; http:// 
www.bbt-se.com/en/home/.) The projected best-fitting planes are red in the top 
panels of Fig. 4 and the left panels of Extended Data Figs 7-9 and blue/green in the 
related cross-sectional slices. 

Following consolidated stratigraphic procedures”!, we have considered the 
strata to be parallel to one other, laterally continuous, and with a constant thick- 
ness throughout the section. This means that the attitude of inner strata is con- 
strained by the orientation of the strata above at the surface. Hence vectors 
perpendicular to the strata planes at the surface interpolated from one terrace to 
another define the field lines (yellow in the cross-sectional slices), which constrain 
the strata orientations underneath the surface (black dashed lines). By drawing 
the geological sections starting from the best-fitting planes transecting the 
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topographic profile and following the constraints given by the derived field lines 
(that is, strata must be perpendicular to the yellow field lines), we were able to 
partially reconstruct the geological structure of the interior of the nucleus up to 
depths of several hundred metres (Fig. 4 and Extended Data Figs 7-9). In sum, 
all the strata drawn in the sections originate from one or more measurements 
taken at the surface, are parallel to each other and laterally continuous (as they are 
in the OSIRIS images), maintain the same thickness throughout the section and 
have their attitude constrained by the orientation of the strata above. The strata 
identified in the longitudinal and transversal sections are not necessarily the same 
since they cut through different topographies. 

Figure 4 and Extended Data Fig. 9 show all the strata intersected by the dis- 

played sections, whereas Extended Data Figs 7 and 8 show simplified longitudinal 
sections, to clarify the procedure we used. Indeed, on one hand, within the lon- 
gitudinal sections of Extended Data Figs 7 and 8, we have reported only the 
extension of the strata measured at the surface of the same section (blue lines 
transecting the topographic profiles). On the other hand, in each transversal 
section of Extended Data Fig. 9 we have considered all the strata, including the 
ones derived only from the longitudinal sections and intersecting the transversal 
traces at depth (Extended Data Fig. 9). In summary, almost the whole interpreta- 
tion has been directly constrained by the orientation of the best-fitting planes and 
related field lines. Geo-structural interpretation indirectly controlled by the atti- 
tudes of contiguous strata and mere observations of the surface, which is a com- 
mon practice in drawing classical geological sections of the Earth and other planets 
of the Solar System, has been applied in rare locations and is identified by grey lines 
in Fig. 4 and Extended Data Figs 7-9. 
Gravity-field vectors. The local gravity-field vectors of the entire comet nucleus 
and of the two separated lobes have been computed by modelling the bodies as 
polyhedrons with triangular facets. The gravity potential is computed for each 
shape model, following the theory of Werner and Scheeres”, adding up the con- 
tribution from all the facets of the polyhedron. The centrifugal force due to the 
comet rotation is included in the final computation of the force field. 
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Extended Data Figure 1 | Seth region. a, b, General view (a) and interpreted 
image (b) with the main terraces (pale green), and strata margins (red dashed 
lines) of the Seth region. c, Stereographic projection (north-polar aspect) of the 
main lobe, displaying heights derived from a stereo-photogrammetric shape 

model’’. Colour scale indicates height above a sphere of radius 1.5 km centred 


in the main lobe. Terraces are aligned along different levels of the same 
elevation. d, NAC view of landing site candidate A (see location in panels 
a-c). Terraces in the upper wall of site A (small white arrows) together with 
parallel lineaments (large white arrows), define a continuous stratification. 
Yellow arrows indicate other terraces of the Seth region. 
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Extended Data Figure 2 | Imhotep regions and surroundings. a, NAC 
overview of the Ash, Apis, Khepry and Imhotep regions; in the background 
are regions of the head. The red dashed line separates the head from the 
main body. The white square is the location of c. The eye symbol shows the view 
of d. b, Map of the terrace, cuesta and mesa margins (dashed red lines) showing 


the general attitudes of strata in the main body. c, Close view of the stratification 
within Ash, where thinner strata are only a few metres thick. d, Imhotep terrace 
margins and inner stratification. Strata of site B dip underneath smooth 
deposits, showing their exhumed nature. e, Metre-to-centimetre scale strata 
heads within Imhotep. 
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Khepry 


Extended Data Figure 3 | The Babi and Aten regions. a, NAC overview of _ terraces and strata. c, The Babi-Aten boundary is underlined by staircase 
Babi region and surroundings. White squares are the locations of b and terraces, indicated by black arrows. d, Detailed view of stratification in Aker: 
d. b, Detailed NAC view of Babi stratification: white arrows indicate small white arrows show strata cut by fractures. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Figure 4 | The Hatmehit region. a, NAC mosaic of the highlighted by the red dashed lines in b and c. The terrace step within Hatmehit 
Hatmehit region and surroundings. The red dashed lines show the cuesta-like (white arrows in d and e) and the terraces at its margin (black arrows in 
margins in Ma’at and Maftet. White arrows indicate the terrace step within e) are visible in the stereo-photoclinometry Digital Terrain Model (d) and 
Hatmehit. White squares are the locations of b and c. The apparently the related perspective view (e). 

centripetal morphologies all around Hatmehit are due to stratified scarps, 
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Extended Data Figure 5 | Standard deviations of best-fitting planes. the z axis of the points used to retrieve a given plane; ¢,, and o, are the standard 
Distribution of the ratios between the standard deviations of the 3D coordinates deviations along the other two perpendicular axes. The mean ratio is 0.0574, 
in the local reference system (where the z axis is normal to the best-fitting which corresponds to a maximum error on the dip angle and dip direction 


plane). Nis the number of best fitting planes; a, is the standard deviation along _ of the best-fitting planes lower than 2°. 
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Extended Data Figure 6 | 3D views of the nucleus and strata of comet (corresponding to the centre of the drawn plane) on the comet nucleus. Note 
67P/Churyumov-Gerasimenko. On the left side are four different 3D views of how the two lobes show independent envelopes. The colour scale indicates the 
the nucleus of comet 67P/Churyumov-Gerasimenko with best-fitting planes | angular deviation between the plane vector and the local gravity vector 
derived from the stratification. On the right side are the best-fitting planes (calculated for the whole body, assuming uniform density). 

alone. Each plane indicates the orientation of strata at that specific location 
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Extended Data Figure 7 | Eastern sector cross-sectional slices through arrows are vectors perpendicular to each best-fitting plane; yellow lines show 
comet 67P/Churyumov-Gerasimenko along its major axis (longitudinal the field of lines used for drawing strata within the comet nucleus (strata 
sections). On the left are perspective views of the comet nucleus showing the _are perpendicular to the yellowlines); dashed black lines are the strata departing 
different cross-sectional slices. In red are shown the best-fitting planes from measurements at the surface of each longitudinal section. The traces 


projected into the geological section; in blue all the other best-fitting planes.On —_ of transversal sections A, B, C and D and the locations of some regions are 
the right are geological sections. In blue are shown the best-fitting planes; green _also reported. 
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Extended Data Figure 8 | Western sector cross-sectional slices through the 
comet along its major axis (longitudinal sections). On the left are perspective 
views of the comet nucleus showing the different cross-sectional slices. In 
red are shown the best-fitting planes projected into the geological section, 

in blue is shown all the other best-fitting planes. On the right are geological 
sections. In blue are shown the best-fitting planes; green arrows are vectors 
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perpendicular to each best-fitting plane; yellow lines show the field of lines used 
for drawing strata within the comet nucleus (strata are perpendicular to the 
yellow lines); dashed black lines are strata departing from measurements 

at the surface of each longitudinal section; dashed grey lines are inferred strata. 
The traces of transversal sections A, B, C and D and the locations of some 
regions are also reported. 
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Extended Data Figure 9 | Cross-sectional slices through the comet lines used for drawing strata within the comet nucleus (strata are perpendicular 
perpendicular to the major axis (transversal sections). On the left are to the yellow lines); dashed black lines are strata departing from measure- 
perspective views of the comet nucleus showing the different cross-sectional ments at the surface of each section and from the intercept of strata of the 
slices. In red are shown the best-fitting planes projected into the geological longitudinal sections; dashed grey lines are inferred strata. The traces 

section; in blue are shown all the other best-fitting planes. On the right are of longitudinal sections 1, 2, 3, 4, 5 and 6 and the locations of some regions are 


geological sections. In blue are shown the best-fitting planes; green arrows are _also reported. 
vectors perpendicular to each best-fitting plane; yellow lines show the field of 
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Extended Data Table 1 | ID numbers for the OSIRIS images 


NAC_2014-09-14T17.32.15_F41, NAC_2014-09- 
14717.31.24_F22, NAC_2014-09-15T05.43.04_F41, 
NAC_2014-09-14T05.05.28.432 F22 


Figure OSIRIS IMAGES 
la NAC_2014-08-06T02.20.01_F28 
1b NAC_2014-09-22T01.48.46 F41 
ic NAC_2014-09-05T02.47.24 F71 
id WAC_2014-09-18T05.23.28 F17 
le NAC_2015-03-10T20.05.49 F22 
2a NAC_2014-08-07T20.37.34 F22 
2b NAC_2014-09-13T08.19.08 F22 
2c WAC_2014-09-12T18.51.33_ F17 
2d NAC_2014-09-19T12.41.05_F41 

Extended Data OSIRIS IMAGES 
ta,b NAC_2014-08-08T03.37.34 F22 
id NAC_2014-09-19T13.16.13_F22 
2a,b NAC_2014-08-16T14.59.14_ F22 
2c NAC_2014-09-20T19.01.50 F22 
2d NAC_2015-03-17T03.37.46_F41 
2e NAC_2015-02-14T12.36.35 F88 
3a NAC_2014-08-25T12.42.54 F22 
3b NAC_2014-09-13T01.02.49._ F22 
3c NAC_2014-08-16117.59.14 F22 
3d NAC_2014-09-05T09.35.5 F22 
4a,b,c Mosaic: NAC_2014-09-14T17.47.55_F22, 
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The first three letters indicate the instrument used to acquire the image (NAC or WAC); the following digits are the time (in Coordinated Universal Time, utc) of imaging (year-month-day, then hour-minute- 


seconds); the last two numbers correspond to the filters. 
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Observation of the competitive double- gamma 


nuclear decay 


C. Walz', H. Scheit', N. Pietralla', T. Aumann’, R. Lefol’? & V. Yu. Ponomarev! 


The double-gamma (yy)-decay of a quantum system in an excited 
state is a fundamental second-order process of quantum electro- 
dynamics. In contrast to the well-known single-gamma (y)-decay, 
the yy-decay is characterized by the simultaneous emission of two 
quanta, each with a continuous energy spectrum. In nuclear phys- 
ics, this exotic decay mode has only been observed for transitions 
between states with spin-parity quantum numbers J* = 0* (refs 1-3). 
Single-gamma decays—the main experimental obstacle to observ- 
ing the yy-decay—are strictly forbidden for these 0* — 0* transi- 
tions. Here we report the observation of the yy-decay of an excited 
nuclear state (J”=11/2") that is directly competing with an 
allowed y-decay (to ground state J” = 3/2*). The branching ratio 
of the competitive yy-decay of the 11/2~ isomer of '*’Ba to the 
ground state relative to its single y-decay was determined to be 
(2.05 + 0.37) x 10-®. From the measured angular correlation 
and the shape of the energy spectra of the individual y-rays, the 
contributing combinations of multipolarities of the y radiation 
were determined. Transition matrix elements calculated using the 
quasiparticle-phonon model reproduce our measurements well. 
The yy-decay rate gives access to so far unexplored important nuc- 
lear structure information, such as the generalized (off-diagonal) 
nuclear electric polarizabilities and magnetic susceptibilities’. 

Two-photon processes, that is, excitation and decay of a quantum 
state simultaneously involving two photons whose energy sum E, + E, 
matches the transition energy Ep, have been studied intensely in the 
past two decades in atomic physics** and are now routinely applied in 
spectroscopic studies. These processes were first discussed theoret- 
ically in 1929°’, and the same approach predicted the existence of 
the rare double-f-decay of atomic nuclei as the analogous second- 
order process for the electroweak interaction®. In contrast to the situ- 
ation in atomic physics, data on the yy-decay of atomic nuclei are very 
sparse. Up to now, yy-decays of nuclear states have only been observed 
for the special cases of the first excited states of the “doubly-magic 
nuclei '°O, *°Ca and °Zr (refs 1-3, 9). The first excited states of these 
nuclei have the unusual property of having spin-parity quantum num- 
bers J“ =0*, which is also the case for their ground states. Hence, 
y-decays are strictly forbidden for these transitions. For almost all 
other nuclei, the first excited states have spin quantum numbers larger 
than zero, making y-decays possible. Despite several searches'®”’, the 
yy-decay has never been observed in a situation where a single y-ray 
transition is allowed, a situation we call the competitive yy-decay, 
denoted by ‘yy/y-decay’. Preliminary evidence for the yy/y-process 
obtained by other groups in parallel with our work has been 
announced elsewhere (ref. 12, and D. J. Millener and R. J. Sutter 
(personal communication); ref. 13, and C. J. Lister (personal commun- 
ication)). In the present work, the experimental difficulties have been 
overcome and we report here firm observation of the yy/y-decay of a 
nuclear state. 

The experimental challenge to the observation of the competitive 
yy-decay arises from its decay rate, which is more than five orders of 
magnitude smaller than that of the allowed y-decay. In an ensemble of 
nuclei undergoing y-decay from a given excited state, there are a large 


number of ) quanta being emitted, which cause two basic challenges, 
illustrated in Fig. 1. First, a y quantum with transition energy Ey can 
deposit part of its energy in one detector and then scatter into the 
second detector, depositing there its remaining energy. The sum of 
the energies registered in both detectors equals the transition energy, 
which is the signature of the yy-decay. Second, two y quanta, each 
with energy Ep, can be emitted independently by two nuclei at 
almost the same time, and deposit a fraction of their energy in two 
detectors, such that the energy sum is again close to Ey. Although these 
random coincidences can be subtracted from truly simultaneous 
events, they cause a substantial statistical uncertainty, thereby prevent- 
ing the observation of the much rarer yy/y-decay unless they are sup- 
pressed properly. 

In our experiment, the first excited 11/27 state of '*’Ba was popu- 
lated through B~ decays of '*’Cs nuclei, a well known y-ray calibration 
standard. This isomer decays dominantly via a single-y magnetic hexa- 
decapole (M4) transition to the 3/2* ground state, emitting a y-ray 
with an energy of 661.66 keV (Fig. 2a inset). The emitted y-rays were 
detected using five large-volume LaBr3:Ce (Ce-activated LaBr3) detec- 
tors positioned in a planar fashion symmetrically around the '*’Cs 
y-ray source (Fig. 1a). Thus, the angle between two adjacent detectors 
amounted to 72°, and two groups of five detector pairs can be formed 
with relative angles of 72° (‘72°-group’) and 144° (‘144°-group’). The 
undesired Compton scattering of 661.66-keV y-rays between two 
detectors was suppressed by thick lead shields. Data were recorded 
for a measurement time of 52.7 days. 

The spectrum of the time-difference (At) between two hits in two 
detectors of the 72°-group is presented in Fig. 1b inset. The prompt 
time-coincidence peak centred at At=0 sits on a flat background 
caused by random coincidences. The time condition |A¢| = 1.2 ns 
selects true (prompt) as well as random coincidences, resulting in 
the energy-sum spectrum E, + E, shown in Fig. 1b (orange filled 
circles). The contribution of random coincidences is determined by 
imposing a wide time gate on the flat background and scaling the 
corresponding energy-sum spectrum with a factor of 0.0214 to correct 
for the different widths of both conditions (Fig. 1b, green filled trian- 
gles). The random coincidences dominate the statistical uncertainty. 

The final energy-sum spectrum of the 72°-group—after subtraction 
of random coincidences—is shown in Fig. 2a. We imposed the energy 
condition |E; — E| < 300 keV to obtain further background suppres- 
sion (see Supplementary Information). The spectrum is well described 
by assuming a superposition of an exponential background and a 
Gaussian peak at 661.6(1.6) keV in agreement with the expected 
energy sum (here and elsewhere, numerals in parentheses indicate 
the lo uncertainty). It has an area of 693(95) counts, establishing 
the observation of the competitive yy-decay. 

Reasonable explanations other than yy/y-decay can be excluded. 
First, let us assume that despite our careful assembly of the lead shields, 
the peak is caused by Compton-scattered 661.66-keV y-rays (due, for 
example, to a small hole in the absorptive lead shields). The additional 
flight path of the y quantum from one detector to the other would 
cause a time delay to which the LaBr3:Ce detectors are sensitive. Hence, 
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Counts per 7.5 keV 


Figure 1 | The two main experimental obstacles to measuring the yy/y- 
decay. a, Schematic illustration of the experimental set-up. A 661.66-keV y-ray 
(green wavy line) emitted from the central source (black dot) can deposit 
part of its energy in detector 1 (labelled red square) and then be scattered into 
detector 2 where it deposits its remaining energy. Lead (grey) is used to 
suppress this effect relative to the yy-events (blue wavy lines). Lead collimators 
(dark grey) are used for background suppression. b, Energy-sum spectra of the 
72°-group with time gates on the prompt coincidence peak (orange filled 


this possibility can be excluded by investigating the shape of the time- 
difference spectra. A narrow energy gate on the yy/y-sum-energy 
peak results in the time-difference spectrum presented in Fig. 2b 
(orange filled circles). The contribution of the background counts 
(solid green curve) is determined with energy gates on the background. 
The time-difference spectrum follows a Gaussian distribution centred 
at At=0 (solid orange curve) with a full-width at half-maximum 
that is in agreement with the time resolution of our LaBr3:Ce 
detectors, namely 1 ns. In the case of Compton scattering, a superposi- 
tion of two Gaussians with centroids at At = +0.8 ns (solid red curve 
in Fig. 2b) would be expected, in contradiction to the observed 
time spectrum. 

A second possibility for the origin of the peak is a sequential decay 
via the 1/2* state at an excitation energy of 283.5 keV (Fig. 2a inset), 
corresponding to two subsequent y-decays. Our observed y-ray 
energy spectra of individual photons—gated on yy/y-events—are 
continuous and do not peak at the transition energies to and from 
the 1/2* state, thereby excluding sequential decay. In addition, the 
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20ns < |At| <76ns). The ordinate values should be multiplied by the given 
factor. Inset, time difference spectrum (red trace) and parts of the imposed 
time gates (orange bar (prompt) and green areas (random)) used to obtain the 
energy-sum spectra of b. The energy condition |E, — E,| < 300 keV was 
imposed on data reported in b as well as on that shown in the inset. Error bars, 
+1 s.d. 


y-branch to the 1/2* state has been measured recently and amounts 
to only 1.12(9) x 10” (ref. 14), less than 6% of our measured 
yy/y-branching ratio. 

In addition, data were recorded for the detector group forming 
opening angles of 144°. A Gaussian peak is found in the corresponding 
energy-sum spectrum at an energy of 664.2(28) keV and with an area 
of 307(78) counts (Fig. 3). This shows that the yy/y-decay of the 11/2 — 
isomer of '*’Ba exhibits a very pronounced angular correlation of the 
two emitted y quanta, characteristic of the multipolarities involved in 
the transition. 

In the theoretical treatment of the yy-decay process, it follows 
from second-order perturbation theory that the calculation of the 
yy-decay transition rate*' involves a summation over a complete set 
of states, subject to well known electromagnetic selection rules based 
on the multipolarity and character of the transition. In a reasonable 
approximation, the possible decay paths can be restricted to two 
cases (see Supplementary Information for a justification of this 
approximation), which we will describe here in the language of 
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Figure 2 | Energy-sum spectrum and energy-gated time spectra of the 72°- 
group. a, Energy-sum spectrum E, + E) after subtraction of the random 
coincidences (requiring the energy condition |E, — E,| < 300 keV). The 
spectrum is fitted with a superposition of a Gaussian and an exponential 
function to describe the peak and the background, respectively. The orange 
(647 keV < Ey + E, < 677 keV) and green areas (582 keV < Ey + E, <632 keV 
and 692 keV < E, + E, < 742 keV) represent the energy conditions employed 
to obtain the time-difference spectra displayed in b. Inset, level scheme of 
137Ba with level energies in keV on the left and spin-parity J* quantum 


numbers on the right. The solid red arrow labels the single-y transition of 
the 11/2” state to the ground state. The dashed blue arrows illustrate the 
structure of the matrix element of the yy/y-decay involving the lowest 7/2* 
state. b, Time difference spectra. The orange data points correspond to the 
orange region in a, while the green solid line with surrounding shading shows 
the background corresponding to the green area in a. The solid orange curve 
shows the expected time spectrum for yy-decay, while the solid red curve 
shows the expected time spectrum, assuming the peak at 661.66 keV was 
caused by Compton-scattered -rays. Error bars in a, b are +1 s.d. 
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Figure 3 | Energy-sum spectrum of the 144°-group. The energy-sum 
spectrum E + E, of the 144°-group after subtraction of the random 
coincidences (with the energy condition |E, — E2| < 250 keV) is shown by the 
red data points. The Gaussian (solid blue curve) on top of an exponential 
background (dashed-blue curve) with a peak area of 307(78) counts is located at 
an energy of 664.2(28) keV, in agreement with the transition energy of 
661.66 keV. Error bars, +1 s.d. 


‘Feynman diagrams’: in the first case, the nucleus undergoes virtual M2 
transitions from the 11/2™ state to intermediate 7/2* states (Fig. 2a 
inset), which are coupled by E2 transitions to the 3/2* ground state. 
The final matrix element is obtained by a summation over all inter- 
mediate 7/2* states of °”Ba, whose contributions add up coherently. 
The second possibility is an E3M1 transition through intermediate 
5/2* states. The theory of the yy-decay is fully developed (see, for 
example, the Appendix of ref. 3; a first application of the formalism 
to °”Ba was performed by D.J. Millener (personal communication)), 
and the differential branching ratio for the current case is approxi- 
mately given by 
ce 

dadQdQ’ 
where I,,,, is the decay width for two-photon emission, and s stands for 
{L, L’, I,, ©, 012}; here L and L’ label the multipoles of the two virtual 
transitions involved, I, is the spin of the intermediate state n, ~ is the 
energy of one of the y-rays, 0; denotes the angle between the two 
emitted y-rays (that is, in our experiment 0). = 72° and 144°) and dQ 
and dQ’ are their solid angle elements. The coefficients %j)4. and 
%ir3 contain the nuclear structure information and are defined as 


7 EIS) SUE) 


- E, —0.5Ep 
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OSL'sL = 


where S and S’ are the multipole characters. Each intermediate 
state |I,,) contributes with a product of matrix elements connecting 
it to the ground state ((3/2"||S’L’||I,)) and to the 11/2” isomer 
(<I,,||SL||11/2)) weighted approximately with the inverse of its 
excitation energy E,. The quadrupole-quadrupole (Agq) and octu- 
pole-dipole (Aga) terms of equation (1) depend only on Oe ais and 
O13» Lespectively, while the interference term A, depends on 
their product. Each term has a characteristic dependence on 6, and 
the y-ray energy w. Thus, the values of &%p22, %mig3 and the sign of 
On2m2%miz3 can be determined from an analysis of the measured 
angular correlation and the energy spectra of the emitted y-rays. 
More information and the full form of equation (1) is given in Supple- 
mentary Information. 

In Fig. 4, the result of a simultaneous fit to the energy spectrum 
(Fig. 4a) and angular correlation (Fig. 4b) is presented. The available 
data are well described by equation (1) (solid blue curve), and the 
contribution of the Ag, term (dashed blue curve) dominates in com- 
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Figure 4 | Measured energy spectrum and angular correlation. 6 (6’) is the 
differential yy-decay width from equation (1) integrated over the 
corresponding energy bin, divided by the y-decay width I, and evaluated at the 
average emission angles 0,, = 72° and 144° (see Supplementary Information 
equation (2) for details of 6 and 6’). a, Data points, dependence of 6 on the 
energy E, of the lower-energy y-ray for 0,7 = 72°. Also shown are the results of 
the fit of equation (1) (solid blue curve) and the contributions of the Agq 
(dashed blue curve), Aoq (blue dotted curve) and A, terms (blue dash-dotted 
curve). Results of the QPM calculation (orange solid curve) are compared to the 
data. b, Data points, dependence of 5’ on 0; integrated over the energy bin 
|E, — E)| < 250 keV. Other curves as in a. All values are corrected for the small 
contribution of the cascade of the decay via the 1/ 2* state. Error bars, +1 s.d. 


parison to the Ag term (dotted blue curve). The angular correlation is 
very sensitive to the product of p24 and 153. The final branching 
ratio amounts to I’,,,/I;, = 2.05(37) X 10 °, and the values for the two 
matrix elements are given in Table 1. 

In Fig. 4 and Table 1 our data are also compared to predictions of the 
microscopic quasiparticle-phonon model (QPM)’* (solid orange 
curves). Further details of the QPM calculation are given in 
Supplementary Information. The QPM reproduces the experimental 
branching ratio and the two matrix elements rather well. As experi- 
mentally observed, it predicts a dominant A,, term and a positive sign 
for &b22%mir3- According to the QPM results, the value of o%p2m2 is 
dominated by the matrix element coupled to the lowest 7/2" state at an 
excitation energy of 1,252 keV (Supplementary Information). The rea- 
son is twofold: the lowest 7/2* is close in energy to the 11/2 state, 
reducing the size of the denominator in equation (2), and almost all of 
the low-lying E2-strength is concentrated in the 3/2* >7/2/ 
transition, resulting in a large (3/2* ||E2||7/2;") matrix element. 

In conclusion, we have observed the yy-decay of a nuclear transition 
in competition with an allowed y-decay. Our results demonstrate that 
theory provides a realistic description of the double-photon decay, and 
that the present state of the art of experimental equipment allows such 
a process to be measured even in the presence of the direct one-photon 


Table 1 | Measured and theoretical parameter values 


Parameter Experiment QPM theory 
T/T, (10~®) 2.05(37) 2.69 
ctgomo(e* fm* MeV~ 1) +33.9(2.8) +42.60 
ayies(e* fm* MeV~+) +10.1(4.2) +9.50 


See text for details of parameters. The uncertainties include the statistical error from the fit (+1 s.d.) and 
systematic contributions. 
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decay. This opens a new field of studies that were not previously 
possible. While the branching ratio is probably too small for the obser- 
vation of the yy-decay of a first excited 2* state, which is present in 
nearly all even-even nuclei, we expect that there are other odd-A nuclei 
where the competitive yy-decay can be measured. Such experiments 
will be difficult and time-consuming, and the number of suitable 
nuclides obtainable in an isomeric state is limited. Nevertheless, the 
sensitivity of the experiment is now sufficient to enable investigation of 
so-far-unexplored nuclear structure observables, namely the (off-diag- 
onal) nuclear electric and magnetic polarizabilities. 
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Quantum computation requires qubits that can be coupled in a 
scalable manner, together with universal and high-fidelity one- and 
two-qubit logic gates’”. Many physical realizations of qubits exist, 
including single photons’, trapped ions*, superconducting cir- 
cuits”, single defects or atoms in diamond” and silicon®, and semi- 
conductor quantum dots’, with single-qubit fidelities that exceed 
the stringent thresholds required for fault-tolerant quantum com- 
puting”. Despite this, high-fidelity two-qubit gates in the solid 
state that can be manufactured using standard lithographic tech- 
niques have so far been limited to superconducting qubits’, owing 
to the difficulties of coupling qubits and dephasing in semi- 
conductor systems'''*, Here we present a two-qubit logic gate, 
which uses single spins in isotopically enriched silicon’* and is 
realized by performing single- and two-qubit operations in a 
quantum dot system using the exchange interaction, as envisaged 
in the Loss-DiVincenzo proposal”. We realize CNOT gates via 
controlled-phase operations combined with single-qubit opera- 
tions. Direct gate-voltage control provides single-qubit address- 
ability, together with a switchable exchange interaction that is 
used in the two-qubit controlled-phase gate. By independently 
reading out both qubits, we measure clear anticorrelations in the 
two-spin probabilities of the CNOT gate. 

Quantum dots have long been considered an attractive physical plat- 
form for quantum information processing’. Large arrays can be con- 
veniently realized using conventional lithographic approaches. 
Initialization, read-out, control and coupling can be achieved through 
local electrical pulses, possibly in combination with magnetic resonance 
techniques. Early research focused mainly on III-V semiconductor com- 
pounds such as GaAs, resulting in single-spin qubits’’, singlet-triplet 
qubits’® and exchange-only qubits’’, which can be coupled capacitively"! 
or via the exchange interaction’. Although these approaches dem- 
onstrate the potential of quantum-dot qubits, strong dephasing due to 
the nuclear spin background has limited the quality of the quantum 
operations. A marked improvement in coherence times has been 
observed by defining the quantum dots in silicon’"*””, which can be 
isotopically purified’, such that quantum dots with single-spin fidelities 
above the threshold of surface codes’? can be realized’. 

A scalable approach towards quantum computation ideally requires 
that the coupling between qubits can be turned on and off’, so that 
single- and two-qubit operations can be selectively chosen. Here, we 
push silicon-based quantum information processing beyond the sin- 
gle-qubit level by realizing a controlled-phase (CZ) gate, which is com- 
monly used in superconducting qubits’ and has been theoretically 
discussed for quantum-dot systems’. This two-qubit gate, together with 
single-qubit gates, provides all of the necessary operations for universal 
quantum computation. In our system, each qubit is defined by the spin 
state of a single electron, with energies split by a large magnetic field of 
strength By = 1.4'T. The single-qubit states are manipulated using spin- 
resonance techniques, through the local application of an oscillating 
magnetic field produced by an on-chip electron spin resonance (ESR) 
line. By exploiting the Stark shift, we electrically control the effective 


g-factor of the qubits, to tune the Zeeman energy Ez = gupBp and the 
associated qubit resonance frequency v = Ez/h for selective qubit con- 
trol’, where jg is the Bohr magneton and h is the Planck constant. The 
two-qubit gate is then realized using electrical pulses that control the 
exchange coupling between the qubits. 

Figure la shows a schematic and Fig. 1b shows a scanning electron 
microscope image of the double-quantum-dot structure fabricated on a 
*8Si epilayer with a residual *°Si concentration of 800 p.p.m. (ref. 14). 
The device consists of three aluminium layers, nine aluminium gates, an 
aluminium lead for ESR control*’, and source, drain and reservoir leads 
that connect to a gate-induced two-dimensional electron gas using 
multilevel gate-stack silicon metal-oxide-semiconductor technology”. 
A single-electron transistor (SET) is formed to monitor the charge state 
of the quantum-dot system and for spin read-out using spin-to-charge 
conversion”. For both our single- and two-qubit experiments, we tune 
the gate voltages to the appropriate dc operating regime and then adjust 
gate G, for qubit read-out, initialization and control. 

Figure 1c shows the stability diagram of the double-quantum-dot 
system with charge occupancy (N>, N,). The charge transitions of 
quantum dots D, and D2, which are underneath gates G, and Gp, 
respectively, are distinguished by their gate voltage dependence and 
their capacitive coupling to the SET. We define qubit Q; by loading a 
single electron into D,, so that N; = 1; similarly for qubit Qo, we have 
N>=1. 

To characterize the individual qubits, we bias the gate voltages such 
that the tunnel time of the respective qubit to the reservoir is approxi- 
mately 100 pis and both qubits are measured in the (1, 1) charge state. 
Clear Rabi oscillations are observed as a function of microwave pulse 
time t, for both qubits, as shown in Fig. 1d. Q; has a dephasing time of 
T> =120 us, with a coherence time T, that can be extended up to 28 ms 
using CPMG (Carr-Purcell-Meiboom-Gill) pulses’, and Q, has a 
dephasing time of T}=61 ps (see Supplementary Information 
section 3). 

We couple the qubits via the exchange interaction, as discussed in 
ref. 2, with an exchange coupling that is electrically controlled via the 
detuning energy € (see Fig. 2a, b). We control the system in the (1, 1) 
region, and read out Q, at the (1, 1)-(0, 1) transition and Q, at the 
(0, 1)-(0, 0) transition. 

The presence of a sharp interface and large perpendicular electric 
fields increases the energy of all excited states“, and allows us to 
consider only the lowest five energy states’® (see also Supplementary 
Information section 2). We can consequently describe the system in 
the rotating-wave approximation in the basis [Q,,Q,], Y= [|{,1), 
IT; 1), ll 1); ||, 1), |0, 2)], with the effective Hamiltonian 


Ez—v Q Q 0 0 
Q  &Ez/2 0 Q to 
H= Q 0 —8Ez/2 Q =f (1) 
0 Q Q —Ez+v 0 
0 to —to 0 U-—e 
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Figure 1 | Silicon two-qubit logic device, incorporating SET read-out and 
selective qubit control. a, b, Schematic (a) and scanning electron microscope 
coloured image (b) of the device. The quantum dot structure (labels Gc and 
G,-4) can be operated as a single or double quantum dot by appropriate biasing 
of gate electrodes G,—Gy,, where we choose here to confine the dots D, 
underneath gates G) 2, respectively. The confinement gate Gc runs underneath 
G,-G; and confines the quantum dot on all sides except on the reservoir (R) 
side. Qubit operation is achieved via an ac current I. through the ESR line, 
resulting in an ac magnetic field B,.. ¢, Stability diagram of the double quantum 
dot obtained by monitoring the current Isp; through the capacitively coupled 
SET. The numbers in parentheses are the change occupancies of D2: (N2, Ni). 


where Ez is the mean Zeeman energy, 5Ez is the difference in Zeeman 
energy between the dots, Q is the Rabi frequency, v is the microwave 
frequency and fg is the tunnel coupling; for simplicity we have scaled 
the system such that i=1. In the experiments, we control ¢ by fast 
pulsing only on G. The read-out on Q> (R:) and control (C) bias 
points are depicted in Fig. 2a; we pulse close to the (1, 1)-(1, 2) trans- 
ition, which has the same energy level structure as the (1, 1)-(0, 2) 
transition, shown in Fig. 2b. Single-qubit operations are realized with 
Rabi frequency Q, by matching v to the resonance frequency of one of 
the qubits. The presence of exchange coupling between the qubits 
alters the Zeeman levels as shown in Fig. 2b, where the finite coupling 
between the qubits causes an anticrossing between the |0, 2) and |1, 1) 
states. We experimentally map out the energy levels in the vicinity of 
the anticrossing, as shown in Fig. 2c, d. 

In Fig. 2c, we have initialized Q, and Q, to spin down and, by 
applying a 7-pulse to Q, (zx,q,), we map out the resonance frequency 
of Q, as a function of detuning. We measure exchange couplings of 
more than 10 MHz, above which T; of Q, becomes shorter than the 
m-pulse time t,, = 1.5 ps and so spin flips cannot occur. 

Initialization of antiparallel spin states is possible by pulsing to the 
(1,2) and returning to the (1, 1) charge states (labelled Ip in Fig. 2a). 
In this sequence, an electron tunnels from D, to D, (I in Fig. 2d) 
followed by an electron tunnelling from the reservoir R to Dy (II in 
Fig. 2d). After returning to (1, 1), the electron from D, tunnels back to 
R (III in Fig. 2d), and one of the two electrons on D,, which are in a 


10 15 20 25 30 35 40 45 50 


Microwave pulse time, t,, (us) 


The difference in distance to the SET results in different capacitive coupling, 
such that the individual dots can be easily distinguished. The tunnel coupling of 
the fourth transition (N, = 3 — 4) of D, is relatively weak, which is due to 
valley and spin filling, because there is only one state in the lowest orbital that 
can be occupied. Q, and Q, are realized by depleting D, and D,j to the last 
electron. d, The quantum dot qubits can be individually controlled by 
electrically tuning the ESR resonance frequency using the Stark shift”. Clear 
Rabi oscillations for both qubits are observed. All measurements were 
performed in a dilution refrigerator with base temperature T ~ 50 mK and a dc 
magnetic field of strength By = 1.4 T. 


singlet state, tunnels to D> (IV in Fig. 2d). With this initialization into 
an antiparallel spin state, when we apply a microwave pulse on Q,, the 
spin-up fraction f; approaches 1/2, except when the microwave fre- 
quency matches a resonance frequency of Q;. Owing to the finite 
exchange interaction, there are two resonance frequencies. The lower 
frequency rotates the antiparallel state towards a combination of ||, |) 
and ||,1), where Q, always ends up as spin down. The higher fre- 
quency rotates the antiparallel state towards a combination of |T, |) 
and |{; t), where Q, always ends up as spin up. The results are depicted 
in Fig. 2d, which shows a decrease in f; at the lower branch and an 
increase of f; at the upper branch, demonstrating an exchange spin 
funnel, where both branches are visible (see Supplementary 
Information section 4 for further details). The two-qubit gate is con- 
veniently realized using a quantum dot CZ gate” (see Supplementary 
Information section 6 for theoretical details). This approach allows 
individual control over the qubits in the absence of interaction and 
its associated noise, while using the coupling to perform two-qubit 
operations with a frequency that can be much higher than the sin- 
gle-qubit Rabi rotation frequency. 

As described by equation (1) and depicted in Fig. 2b, changing € 
modifies the qubit resonance frequencies of Q, and Q, and introduces 
an effective detuning frequency v7, (, 7), such that one qubit acquires a 
time-integrated phase shift #7 (7) that depends on the Z component of 
the spin state of the other qubit, and vice versa. The exchange coupling 
and v+ |“, are maximized at the anticrossing € = U; see Fig. 2b. When 
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Figure 2 | Exchange spin funnel. a, Close up of the operation regime of the 
(1, 1)-(0, 2) charge states. We lowered the R-D, coupling so that the tunnelling 
time is approximately 100 ps, matching the qubit experiments. In this range of 
weak R-D, coupling, the emptying and filling of D, is hysteretic with gate 
voltage, because the mutual charging energy becomes relevant”’, as D, can only 
tunnel when it aligns in energy with D2. R2 represents the read-out on Q); Iap 
represents the antiparallel initialization. b, Schematic of the coupling between 
Q; and Q; using the exchange interaction at the |1, 1)-|0, 2) transition. By 
electrically tuning the g factors’ of Q; and Q,, we control the individual qubit 
resonance frequencies over 10 MHz. Here, we tune to a frequency difference of 


a CZ operation is performed such that ¢;; + ¢|; = 1, the operation 
differs only by an overall phase from the basis CZ gate”. This overall 
phase can be removed using single-qubit pulses or via voltage pulses 
exploiting the Stark shift’. To realize a CNOT operation using the CZ 
gate, a CZ(m) rotation is performed in between two 1/2-pulses on Q; 
that have a phase difference $y. 
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Figure 3 | Controlled phase (CZ) gate operation time. a, Spin-up fraction 
after applying a (1/2) x-pulse, a CZ operation and a (1/2) y-pulse, for increasing 
qubit interaction. The exchange coupling is controlled via ¢, set with Vg,, 
resulting in a tunable two-qubit operation frequency v} |. b, By fitting the data in 
a, we map out v;| and T> ; asa function of Vcz. The orange and blue colouring 
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40 MHz (the difference is exaggerated in the schematic for clarity) for 
individual qubit control. c, ESR spectrum of the ||, |)-|1, |) transition as a 
function of increasing detuning. The data have been offset by a frequency 

Vo = 39.14 GHz and the spin-up fractions are normalized for clarity. The 
dashed lines are fits using equation (1) and assuming ty) = 900 MHz, a Stark 
shift of 19 MHz V_!, and that the top gates have a lever arm of 0.2 eV vi. 
d, As for c, but with an additional pulse of amplitude Vp (see schematic) so that 
we initialize antiparallel spin states (AP) and observe both the |7, |)-||, |) 
and ||, 1)-|1, T) transitions. The labels I-IV indicate electron tunnelling 
between R and D,,; see text for details. 


Figure 3a shows the spin-up fraction f; of Q> after applying a (1/2) x- 
pulse anda (1/2) y-pulse on Q, separated by an interaction time tz with 
increasing exchange coupling, set via ¢ and tuned by the voltage Vcz. 
Plotting the frequency v;, as a function of Vcz (Fig. 3b) gives a trend 
consistent with that observed via ESR mapping, as shown in Fig. 2c, d. 
The two-qubit dephasing time T; .., is the free induction decay time of 
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and the arrows indicate the axis each data set corresponds to; the colouring 
of the data corresponds to that in c, indicating Vcz. The inset shows the number 
of possible CZ rotations Ncz. Although T3 ~7 decreases with coupling, the 
number of possible two-qubit rotations continues to increase. 
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the two-qubit system. At large values of detuning (€ — 00) or when the 
interaction vanishes (t9 — 0), T} ;7 reduces to the single-qubit Ramsey 
T;. We obtain T} 7 by fitting an exponential to the decay of the 
oscillations in Fig. 3a. These values of T}.7 are plotted along with 
the measured v; values in Fig. 3b. We find that the two-qubit dephas- 
ing rate CE ez) ~* rises in step with the exchange coupling and vy, 
which is to be expected because dv, / SV also increases with v7, 
meaning that the qubit system becomes increasingly sensitive to 
electrical noise**. Despite this, the total number of oscillations 
Nez=Vt{ Tocz also increases with v;), as shown in Fig. 3b. In 
Supplementary Information section 7, we show an optimized sequence 
where Tocz =8.3 pis and v;; = 3.14 MHz, such that Ncz > 26. 

For all of the experiments described in Figs 2 and 3, we performed 
read-out only on Q:, owing to its proximity to the reservoir used for 
spin selective read-out. However, to demonstrate a two-qubit CNOT 
gate it is desirable to read out the state of both qubits, allowing the 
observation of non-classical correlations. This requires a more com- 
plex pulsing protocol on G, (Fig. 4a), to first read Q, at the 
charge transition (0,0)-(0, 1), subsequently read and initialize Q, at 
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Figure 4 | Two-spin correlations for a two-qubit logic gate. a, Pulsing 
protocol for two-qubit read-out and single- and two-qubit operations. After 
read-out of Q» (R2) and Q; (R:), we pulse back to (R2) to ensure proper 
initialization. Individual qubit operations are performed with high ¢, whereas 
the CZ operation occurs in the presence of interaction. b, Stability diagram 
showing the operation regime. c, Spin-up fraction of both qubits after 
initializing Q; spin up (top) and spin down (bottom) using a microwave pulse 
and applying a controlled rotation using Q, as the target qubit. A CNOT gate is 
achieved in 480 ns, as indicated by the dotted purple line (see inset for the 
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the (0, 1)-(1, 1) transition, and initialize Q, at the (0,0)-(0, 1) trans- 
ition. The read-out (R,,2) and control (C) bias points are shown on the 
charge stability map in Fig. 4b. Following two-qubit read-out of the 
previous state, the system is prepared as ||, |), after which single-qubit 
rotations are applied to each qubit to prepare any desired initial two- 
qubit state. 

Figure 4c shows the measured states of Q; and Q; after applying the 
CZ gate as a function of tz, with Q, initialized to ||) (top panel) and ||) 
(bottom panel). As expected, the control qubit Q, (red) is not per- 
turbed when exchange is turned on because it is in a basis state; how- 
ever, the target qubit Q, (blue) is initialized to allt) + |?)) with a 
(1/2)x-pulse on Q, and so rotates about the equator of the Bloch 
sphere when exchange is turned on for time tz. The strength of the 
exchange coupling is set so that Q, rotates about the Bloch sphere at 
double the frequency for Q; = ||) than for Q; = |1); this is reflected in 
the final state of Q, plotted in Fig. 4c (see Supplementary Information 
section 8 for further details). A CZ gate is realized at tz = 480 ns, when 
$+, + |; = 7, and this is converted toa CNOT gate (the target qubit is 


1 2 3 4 50 1 2 3 4 
Q, microwave pulse time, r, (1s) 


corresponding Bloch sphere animation). d, Two-spin probabilities as functions 
of the microwave pulse length on Q, after applying a CNOT gate (see inset for 
the corresponding Bloch sphere animation), showing clear anticorrelations 
between the two qubit spin states. The different plots correspond to different 
spin states of Qj,2, as indicated. The black lines correspond to fits based on a 
CNOT gate, and include the experimental read-out errors (see Supplementary 
Information section 9). The green dotted lines correspond to the intended 
maximally entangled states. 
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flipped when the control qubit is ||)) by applying (1/2) x-pulses on Q, 
before and after the CZ. 

We use the CNOT gate to create an entangled state of Q; and Q>. To 
realize this, we initialize the qubits first to the lI, |) state, then apply a 
varying microwave pulse time to rotate Q; into superposition states, 
with a Rabi time Trapi = 2.4 Ls, and finally apply the CNOT gate. To 
demonstrate the CNOT gate, we convert the individual qubit spin-up 
fractions into two-spin probabilities; Fig. 4d shows the four possible 
two-spin probabilities. Clear oscillations are observed in the probabil- 
ities of the antiparallel states, P(||,1)) and P(|t, |)), whereas these 
oscillations are almost absent in the probabilities of the parallel states, 
P(|{,)) and P(|f,1)), thereby demonstrating the anticorrelations 
expected for the CNOT gate. The hints of oscillations in the symmetric 
spin states are probably due to read errors (which are included in the 
fitted line in Fig. 4d, see also Supplementary Information section 9); 
our current visibilities are not sufficient to demonstrate violation of the 
Bell inequality. 

Future experiments will include improvements to the read-out fide- 
lities, thus facilitating full two-qubit tomography. The qubit control 
fidelities could be further improved by lowering the sensitivity to 
electrical noise. Although these silicon qubits represent the smallest 
scalable two-qubit system reported so far, the complete fabrication 
process is compatible with standard CMOS (complementary metal- 
oxide-semiconductor) technology, and is also consistent with current 
transistor feature sizes, offering the prospect of realizing a large-scale 
quantum processor using the same silicon manufacturing technologies 
that have enabled the current information age. 
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Peptoid nanosheets exhibit a new 
secondary-structure motif 


Ranjan V. Mannige', Thomas K. Haxton!, Caroline Proulx!, Ellen J. Robertson’, Alessia Battigelli’, 
Glenn L. Butterfoss*, Ronald N. Zuckermann!' & Stephen Whitelam!' 


A promising route to the synthesis of protein-mimetic materials that 
are capable of complex functions, such as molecular recognition and 
catalysis, is provided by sequence-defined peptoid polymers’’— 
structural relatives of biologically occurring polypeptides. Pep- 
toids, which are relatively non-toxic and resistant to degradation’, 
can fold into defined structures through a combination of sequence- 
dependent interactions* *. However, the range of possible structures 
that are accessible to peptoids and other biological mimetics is 
unknown, and our ability to design protein-like architectures from 
these polymer classes is limited’. Here we use molecular-dynamics 
simulations, together with scattering and microscopy data, to deter- 
mine the atomic-resolution structure of the recently discovered pep- 
toid nanosheet, an ordered supramolecular assembly that extends 
macroscopically in only two dimensions. Our simulations show that 
nanosheets are structurally and dynamically heterogeneous, can be 
formed only from peptoids of certain lengths, and are potentially 
porous to water and ions. Moreover, their formation is enabled by 
the peptoids’ adoption of a secondary structure that is not seen in 
the natural world. This structure, a zigzag pattern that we call a 


Observed nanosheets 


X(‘sigma’)-strand, results from the ability of adjacent backbone 
monomers to adopt opposed rotational states, thereby allowing 
the backbone to remain linear and untwisted. Linear backbones tiled 
in a brick-like way form an extended two-dimensional nanostruc- 
ture, the X-sheet. The binary rotational-state motif of the X-strand is 
not seen in regular protein structures, which are usually built from 
one type of rotational state. We also show that the concept of build- 
ing regular structures from multiple rotational states can be general- 
ized beyond the peptoid nanosheet system. 

The peptoid nanosheet is a recently discovered, free-floating planar 
assembly that is only two molecules thick but that extends laterally for 
micrometres (Supplementary Figs 1, 2)*!°"’. Nanosheets assemble from 
peptoids bearing alternating aromatic and charged sidechains (Fig. 1a) 
via compression at an air—water interface*’°"’, We find, through a 
combination of atomic-force microscopy (AFM; Supplementary 
Fig. 3)*"°"? and powder X-ray diffraction (KRD)*”°"’, that nanosheets 
are bilayers, 3.0 + 0.3 nm (+s.d.) thick, in whose interior the aromatic 
sidechains are sequestered, and on whose surfaces the charged side- 
chains are presented. Optical microscopy shows nanosheets to be 


Figure 1 | Snapshot of a peptoid nanosheet 
obtained from molecular-dynamics simulations. 
a, Left, an amphiphilic 28-residue peptoid, which 
assembles into extended nanosheets only two 
molecules thick"®, as shown in the fluorescent- 
microscopy image to the right. b, Snapshot of a 
bilayer, obtained from molecular-dynamics 
simulations. c, Backbones (blue and red) are 
generally collinear; aromatic rings (yellow) in the 
interior of the nanosheet show little order. (The 
lower leaf of the bilayer is coloured white.) 
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roughly rectangular, many square micrometres in horizontal extent, 
and to have relatively straight edges (Fig. la and Supplementary 
Fig. 2b). XRD measurements in the plane of the bilayer show peaks 
at characteristic distances of 4.5 A and 3.6 A (Supplementary Fig. 9), 
suggesting a high degree of order on the molecular scale. Direct obser- 
vation of polymer chains in the nanosheet by transmission electron 
microscopy indicates a polymer-polymer parallel spacing of about 
4.5 A (ref. 8), in accordance with the 4.5-A XRD peak. 

To develop an atomistic model of peptoid nanosheets consistent 
with these observations, we used atomistic molecular-dynamics 
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Figure 2 | Selected nanosheet features and dynamics. a, Three ways of 
viewing the bilayer: outside, inside and edge views. b, The general layout of 
polymers within each leaf is brick-like, with pockets (green triangle) formed at 
peptoid termini. c, d, Each peptoid, such as that shown here, can be coloured 
with conventional atomic colours (c, top), or with respect to dynamism 

(c, bottom; expanded in Supplementary Fig. 12), as measured by root-mean- 
squared fluctuations of atomic positions, showing that backbones possess three 
dynamically distinct regions (d): a rigid domain, a dynamic central linker 
and dynamic termini. As in globular proteins, the backbone is most dynamic in 
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simulations in conjunction with our recently developed CHARMM- 
based" force field for peptoid backbones, MFTOID'® (CHARMM, 
Chemistry at HARvard Molecular Mechanics; MFTOID, Molecular 
Foundry (MF) and Peptoid (TOID)). We surveyed a range of low- 
energy nanosheet configurations (Supplementary Fig. 4a) as starting 
points for molecular-dynamics simulations; here, we report the results 
of one such set of simulations (see Methods). 

Figure 1b and c show snapshots of a nanosheet patch, with periodic 
images displayed (for an image of the simulation box only, see 
Supplementary Fig. 4c). The nanosheet possesses linear order in its 
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termini and linker regions. e-g, Sections of the nanosheet from various angles 
and in various representations (enlarged in Supplementary Fig. 13). Each 

top and bottom panel (related by a double-headed arrow) show the same 
section and viewpoint, rendered in different ways. e, Water (green) encroaches 
at a pocket, suggesting that water channels might be engineered into nanosheets 
(Supplementary Fig. 20). f, Backbone order. g, Aromatic disorder. Real-space 
measurements taken from simulation averages and inferred from scattering 
experiments (shown at the bottom) agree to within 1 A. 
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polymer backbones, and relative disorder in the aromatic and charged 
sidechains that are, respectively, internal and external to the bilayer 
(Supplementary Figs 11 and 12). Individual peptoid chains maintain 
the brick-like arrangement that is favoured by the segregation of 
charged sidechains into positively and negatively charged blocks 
(Fig. 1a), in which gaps or ‘pockets’ between polymer termini alternate 
with the backbones’ central portions, as one looks along the direction 
perpendicular to that of the polymer chains (Fig. 2b). As shown in 
Fig. 2c and Supplementary Fig. 12, backbone regions away from the 
pockets are the least dynamic portions of each peptoid, displaying a 
root-mean-squared fluctuation (RMSE) of <1 A. The pocket-forming 
termini of each peptoid are more flexible than these interior regions, 
and are visibly less ordered (Fig. 2c, d and Supplementary Fig. 12), 
while the polymer sidechains are more flexible still. This hierarchy of 
flexibility is similar to that seen in proteins'*””. 

Figure 2e-g and Supplementary Fig. 8 show that the nanosheet 
thickness, interpolymer spacings and intersidechain spacings seen in 
our simulations lie within about an angstrém of the mean character- 
istic distances seen in AFM and X-ray-scattering experiments*’*"’. 
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The range of nanosheet thicknesses measured by AFM, 2.7 nm to 
3.3 nm (Supplementary Fig. 3), is also reproduced in our simulations 
(Supplementary Fig. 8). These comparisons allow us to assign physical 
features to experimental measurements, and to verify the accuracy of 
our simulations (see Supplementary Table 1). 

This brick-like arrangement of backbones that allows polymers to 
alternate with pockets has important consequences for the stability of 
nanosheets as a function of peptoid length. As polymer length decreases, 
it should become less energetically favourable for each residue to form a 
nanosheet, because pockets—near which peptoids possess fewer favour- 
able electrostatic and aromatic interactions—increase in number per unit 
area as polymer length decreases. Simulations indeed show that short 
polymers are not stable in nanosheet form. In Fig. 3, we report the results 
of simulations done using nanosheets built from polymers that are 4, 8, 
12, 16, 20, 24 or 28 residues long. Polymers longer than 12 residues form 
stable nanosheets. Nanosheets built from 12-residue polymers show signs 
of instability and a decrease in order (Fig. 3a-c) upon simulation, and 
nanosheets built from polymers shorter than 12 residues display an 
almost complete loss of structure within 40 ns of the start of the simu- 
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Figure 3 | Peptoids shorter than 12 residues do not form stable nanosheets. 
a-c, Metrics of stability and order for nanosheets built from peptoids of various 
lengths, L. These metrics are: a, mean polymer alignment with the x axis 
(equation (2) in Methods; see Supplementary Fig. 16a); b, root-mean-squared 
deviation from the initial configuration (heavy atoms only; see Supplementary 
Fig. 16b); and c, trough-to-peak ratio associated with backbone-backbone 
order obtained from simulated X-ray scattering (a spectrum for L = 28 is 
shown in the inset; see Supplementary Fig. 17). Larger values of all metrics 
indicate a lesser degree of order. Short-polymer nanosheets continue to evolve 
throughout the times simulated (a, b). J, intensity; Ip, peak intensity; I;, trough 


intensity; Ijg, intensity expected of an ideal gas (for the purposes of 
normalization); xy X¥ component of the wavevector. d, e, Simulations (d) and 
experiments (fluorescence optical microscopy; e) done with peptoids of lengths 
L=4, 8, 12, 16 or 28 show a similar trend: peptoids shorter than 12 residues 
do not form stable nanosheets. The case L = 12 is marginal: in experiments, 
high concentrations of peptoids result in the formation of small nanosheets, 
which become disordered on a timescale of about one week (see Supplementary 
Fig. 18). Colouring backbones with respect to the backbone dihedral angle, ¢, 
makes clear that adjacent backbone monomers adopt opposed rotational 
states. This novel secondary structure, the X-strand, is discussed in Fig. 4. 
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Figure 4 | Peptoid nanosheets are made possible by a novel secondary 
structure. a-d, Typical secondary structures found in proteins, such as the 
B-sheet (a) and o-helix (c), are described by specific pairs of backbone dihedral 
angles, @ and w, whose position is labelled by i in their respective 
Ramachandran plots (b, d; see Methods). The heat map is a two-dimensional 
histogram showing the normalized probability p(w, ¢) that a residue in one of 
the indicated structures will adopt particular combinations of the dihedral 
angles # and w. e, f, In contrast, the secondary structure observed in the peptoid 
nanosheet (e) consists of two characteristic (#, ~) positions, i and j (f). (The 
other occupied regions in f, that is, the pale blue locations, indicate polymer 
ends and linker regions; see Supplementary Figs 21 and 22. Owing to the 


lation (Fig. 3d and Supplementary Fig. 15). In accordance with our 
simulations, our experiments show that peptoids that are 12 or fewer 
residues in length do not form stable nanosheets (Fig. 3e and 
Supplementary Figs 18 and 19). Our simulations suggest that this failure 
occurs because short-polymer nanosheets are unstable relative to disor- 
dered aggregates. Our simulations also suggest that if polymer connec- 
tions are reinforced—for example, by head-to-tail cross-linking—this 
limitation could be overcome. 

When considering the design of nanosheets, the observation that 
interchain pockets are tolerated by polymers longer than a certain limit 
shows that pockets should be considered to be an integral part of the 
nanosheet structure. Simulations show that such pockets allow water to 
encroach on the aromatic centre of the bilayer (Fig. 2e and Supplemen- 
tary Fig. 20b). This observation suggests that peptoid termini could be 
modified to create pockets able to bind specific small molecules, poten- 
tially permitting catalytic function. Furthermore, our simulations 
show that, if pockets on opposing leaves coincide, then nanosheets 
possess channels through which water can pass (Supplementary 
Fig. 20c), indicating the potential of nanosheets as selective membranes. 
Experimental work to test these predictions is under way. 

The stability of extended, planar peptoid nanosheets is enabled by 
the polymers’ linear, untwisted configurations. Our simulations reveal 
that this linearity results from the ability of sequential backbone resi- 
dues to adopt one of two states, whose rotation about the backbone axis 
opposes and cancels each other. In Fig. 3d we have coloured these two 
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achirality of the peptoid backbone’s «-carbon, i’ andj’ are equivalent to i andj, 
respectively.) Positions i and j are visibly equidistant from the achiral diagonal 
of the Ramachandran plot, and so adopting the two states in an alternating 
fashion allows the peptoid backbone to remain linear and untwisted. This 
motif, the £-strand, allows the formation of extended planar structures that 
have not been made previously using protein-like building motifs. g, Quantum- 
mechanical (QM) calculations of a range of possible Z-type strands show 
energy minima (h) that match the high-occupancy regions of f, indicating that 
the rotational tendencies of isolated polymers are preserved by the sidechain 
interactions established within the nanosheet. (MD, molecular dynamics. The 
alternating pattern is defined by the numbers A and B.) 


rotational states red and blue. This building principle is distinct from 
that used by proteins: the backbones of protein secondary structures 
such as a-helices and f-sheets are defined primarily by a single rota- 
tional state. 

Protein rotational states are quantified by their backbone dihedral 
angles, traditionally denoted ¢ and w, and conventionally described by 
a Ramachandran plot’’. As shown in Fig. 4a—d, regular chiral protein 
structures such as the o-helix and B-sheet correspond roughly to a 
single location, i, on the Ramachandran plot'*. In contrast, stable 
nanosheets are composed of peptoids whose backbone states occupy 
two specific regions of the Ramachandran plot (Fig. 4f), labelled i and j. 
Chains whose adjacent residues alternate between these two states 
remain linear. A snapshot of three backbone segments (Fig. 4e) empha- 
sizes this alternating motif. We call this motif the X-strand, because its 
linear, twist-free nature derives from the combination, or sum (‘D’), of 
its two rotational states (and because the resulting polymer ‘snakes’ 
back and forth). In principle one could have a X-strand built from 
any two opposed rotational states. However, density functional theory 
calculations show that the particular rotational states observed in our 
atomistic simulations are the lowest-energy &-type arrangement for 
isolated polymers (Fig. 4g and Supplementary Figs 23-26). This com- 
parison provides additional confidence in the accuracy of the MFTOID 
force-field simulations, and confirms that the basic rotational tendency 
of an isolated peptoid backbone’ (Fig. 4h) is preserved by the side- 
chain-sidechain interactions established within the nanosheet (Fig. 4f). 
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Figure 5 | The design principle underlying the peptoid nanosheet has 
general application. a, b, Canonical secondary structures are ‘single-point’ 
ones, meaning that they are described by single regions on a Ramachandran 
plot. ‘D’ and ‘7B’ are labels given to certain common peptoid structures. c, Here 
we propose the concept of building higher-order or multi-point secondary 
structures using combinations of points (alternating blue and red dots) on the 
Ramachandran plot. d, This principle can be used to create a large set of novel 
structures, of which the Z-strand is but one example, potentially expanding the 
range of functional polymer structures. For instance, model 8 shows that a 
linear chain can display alternating orthogonal sidechains, which may allow the 


The X-strand motif is protein-like in its regularity, but is different 
from protein secondary structures in important ways: it is a molecular 
motif that permits planar assemblies of macroscopic extent; it is 
stabilized for reasons other than hydrogen bonding; and it is built from 
two (not one) rotational states. The closest comparable secondary 
structures in proteins, B-sheets, are unable to maintain macroscopic 
flatness owing to a propeller-like twist in the shape of each strand 
(although fibril-like protein assemblies can be of considerable size” 
and can be combined to form structures of macroscopic extent”’). The 
&-sheet therefore represents a complement to the basic building blocks 
of the protein world. To replicate the &-sheet assembly using other 
polymers, one therefore needs two things: molecular rotational prop- 
erties that give rise to symmetry about the diagonal of the 
Ramachandran plot (Fig. 4f), and sidechain sequence patterning that 
promotes these conformations within a desired assembly. 

The principle underlying the stability of the X-strand and the peptoid 
nanosheet is that regular secondary structures can be built from more 
than one rotational state—that is, from more than one point on the 
Ramachandran plot. This principle immediately suggests the possibility 
of a large set of novel secondary structures, some of which are shown in 
Fig. 5. The potential applications of this set of structures go beyond 
those of the nanosheet system—the X-strand is but one member of this 
set—and may include the formation of novel folded structures and 
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formation of a planar assembly distinct from the Z-sheet (‘possible Z-strand’). 
Model 5, if cyclized, may allow for the assembly of stacked disks (alternating 
rotational properties have been observed in cyclic peptoids”, while alternating 
LD cyclic peptides have been found to stack”*). As in the case of the X-strand 
(model 7), such structures can in principle be stabilized by physically 
reasonable sidechain interactions possessing appropriate geometries. 
Supplementary Table 2 discusses the method by which the structures in d were 
built, relaxed and ranked. For each key in d, the superscript ‘c’ indicates a cis- 
amide-bond dihedral angle. 


assemblies. Since the 1950s, the literature on protein secondary struc- 
ture has focused on the idea of building with one type of rotational 
state’’, to make ‘single-point’ Ramachandran structures. We propose 
that building higher-order, multi-point secondary structures might 
greatly expand the repertoire of folded polymeric building blocks. 
Figure 5d provides guidelines for backbone design. It may also help 
to characterize structurally the rapidly growing family of solid-state 
peptoid polymer crystals*’. Such crystals consist largely of extended 
polymer conformations that so far have eluded characterization at 
atomic resolution; the extended backbones seen in Fig. 5d are candidate 
conformations. More generally, the new building principle that we have 
identified, combined with the ability to encode into peptoids a defined 
sequence of chemically diverse monomers, offers a way to create new 
structured polymers through combinatorial design. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Experimental nanosheet synthesis. Block-charge peptoids of lengths L = 4, 8, 12, 
16 or 28 residues were synthesized and purified using automated solid-phase 
synthesis’. For L = 12 residues (Supplementary Fig. 18d-f) and L > 12 residues 
(Fig. 3e), peptoids were present at a concentration of 20 1M in 10 mM Tris buffer, 
pH 8.0. Peptoid nanosheets of length L = 12 were absent at this concentration 
(Supplementary Fig. 18d), perhaps because not enough polymer adsorbed to the 
air—water interface (see discussions following Supplementary Fig. 18d); therefore 
we attempted to produce L = 12 nanosheets at higher concentrations (Fig. 3e and 
Supplementary Fig. 18a-c). Peptoids of length L = 12 were present at a concen- 
tration of 1 mM (Supplementary Fig. 18c), of length L = 8 at 22 mM (Supplemen- 
tary Fig. 18b), and of length L = 4 at 41mM (Supplementary Fig. 18a), all in 
10 mM Tris buffer, pH 8.0. Sheets were prepared by agitation via the vial-rocking 
method”. To test whether the peptoids were adsorbing to the air—water interface— 
a prerequisite to assembly—we obtained surface tension data for the L = 4, 8 and 
12 peptoids via the pendant drop method (see discussions following Supplemen- 
tary Fig. 18)**. 

Nanosheet yield analysis by optical microscopy. Identical volumes were 
removed from each sample and applied to a thin agarose-gel slice to allow imaging 
for Fig. 3e and Supplementary Fig. 18 (ref. 25). 

Measuring distances by single-angle X-ray scattering and AFM. Nanosheet 
thicknesses were calculated via AFM*'°'. We obtained AFM images of dry 
nanosheets deposited on a mica substrate in ambient air. Other distances, such 
as intrapolymer spacings (Fig. 2 and Supplementary Fig. 8), were obtained via 
XRD*?°""8, XRD data also show a peak corresponding to the thickness of the 
nanosheet. 

Molecular-dynamics protocol. We surveyed a range of low-energy nanosheet 
configurations (Supplementary Fig. 4a) as starting points for molecular-dynamics 
simulations. To conduct this survey we built 28-residue polymers into monolayers 
ina brick-like arrangement, and joined two identical opposing monolayers to form 
a bilayer (Supplementary Fig. 4a). The brick arrangement is suggested by simple 
electrostatic considerations, because peptoids’ charged sidechains are segregated 
into positive and negative blocks along each chain"' (Fig. 1a). The computational 
search space so defined possesses six degrees of freedom, described by distances 
between polymers within the same layer and between monolayers (Supplementary 
Fig. 4a). We calculated energies (in implicit water) for a large number of these 
parameter sets, and chose six low-energy versions on which to focus 
(Supplementary Fig. 4b). Each nanosheet version was then explicitly solvated 
and relaxed in a sequence of protocols (see Supplementary Fig. 4d), before under- 
going constant-pressure molecular-dynamics simulation using a leap-frog integ- 
rator (a commonly used second-order numerical method for integrating equations 
of motion) with a 1-fs timestep. 

Each of the six low-energy versions of the nanosheet was seen to be stable for 
over 50 ns of molecular-dynamics simulation at standard temperature and pres- 
sure; that is, each had ceased to evolve (structural snapshots for each simulation are 
available in Supplementary Information), and did not dissolve or convert to a 
different structure (Supplementary Fig. 5). (In contrast, high-energy initial con- 
figurations did not yield stable structures.) These simulations reveal that, in the 
region of the energy minimum dictated by charged sidechains"’, there exists a 
rugged free-energy landscape with a range of near-degenerate nanosheet struc- 
tures. Biomolecules, by contrast, often possess clear free-energy minima”*”°. 

Our simulations represent periodically replicated patches of nanosheets of 
approximate dimensions 60 nm X 18nm, and so do not address the nature of 
nanosheet order on the micrometre scale. Given that nanosheets are produced 
by a far-from-equilibrium mechanical protocol", and given the stability of all six 
nanosheet versions in our simulations, it is possible that extended nanosheets 
consist of a patchwork of different types of stable local order. Nonetheless, each 
of the six low-energy nanosheet versions displays molecular features consistent 
with experiments (Supplementary Fig. 8), from which we infer that substantial 
portions of nanosheets display locally the atomic-scale features seen in our simu- 
lations. In the main text, for brevity, we present results from one particular 
nanosheet version, number 5. 

Supplementary Figure 4 describes our generation of the six molecular-dynamics 
starting configurations. Molecular-dynamics simulations were done using the 
CHARMM software package”. Aside from the initial set-up, all nanosheet series 
were simulated using a leap-frog integration algorithm with a 1-fs timestep. 
Simulations were performed in the isothermal-isobaric (NPT, for constant particle 
number, pressure and temperature) ensemble at 300 K and 1 atmospheric pres- 
sure, in an orthorhombic periodic box in which the three orthogonal box dimen- 
sions were allowed to vary independently. The Hoover algorithm was used to 
maintain constant pressure’’. Hydrogen bonds were constrained using the 
SHAKE protocol”. Particle-mesh Ewald summation was used to evaluate long- 
range electrostatic interactions, with a real-space cut-off of 12 A, a sixth-order 
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cubic spline, and a « value (width of the Gaussian distribution) of 0.34. Van der 
Waals interactions were calculated up to a distance of 12 A, with a smoothing 
function applied from 10 A to 12 A. 

Simulations reported in the main text were performed ina solution consisting of 
water, the nanosheet’s counterions, and potassium chloride at a concentration of 
10 mM. We also carried out a series of independent simulations of the version 5 
nanosheet, in which potassium chloride concentrations were set to 0.1 mM, 1 mM, 
50mM and 100mM (Supplementary Fig. 14). These simulations verified that 
simulated nanosheet structures do not change markedly over the range of salt 
concentrations used in our experiments*’"'. 

Computational X-ray scattering. We calculated scattering spectra at wavevector 
q using the expression 


2 


1(q)=|>_ fexpa-r)) (1) 
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where the sum runs over all atoms, f is the atomic scattering factor for atom j and 1; 
is the position of atom j. Equation (1) assumes that electrons are localized at atomic 
sites. We let f; equal the atomic number, an approximation that is nearly exact at 
the experimental X-ray energies of 11 keV (ref. 33). The values of q were discre- 
tized so as to be commensurate with the periodic box. 

Experimental in-plane X-ray-scattering spectra were taken by allowing 

nanosheets to dry on a Kapton grid, stacking the sheets on top of each other to 
produce a sample with a uniform orientation’*. The absence of lamellar peaks when 
the X-ray beam was fired ‘face-on’ into the stack confirmed that the nanosheet 
normals were uniformly oriented parallel to the grid normal. However, the radially 
averaged signal indicated that various sheets and/or domains existed within the 
beamline, as expected given that the beam cross-section (120 um X 800 um) is 
greater than the typical nanosheet size (20j1m X 20pm). To compare with 
these in-plane, radially averaged spectra, we radially averaged the simulated 
scattering spectra (equation (1)) in the xy plane of the nanosheet, 
I(4yy) = (I(q)) 4-0,,/8+8=4," Finally, these intensities (I(q,)) were normalized 
by the expected intensity of an equivalent ideal gas (Ijg(qxy)), which is discussed in 
Supplementary Fig. 17. 
Direct measurements. While X-ray scattering is useful for comparing simulation 
to experiment, direct measurements of some features (for example, polymer 
y-padding, nearest neighbour N-N spacing, bilayer thickness) provide additional 
information with regards to distances and distributions. Supplementary Fig. 7 and 
its associated text discusses those measurements and how they are calculated, 
while Supplementary Fig. 8 compares experimental observations to those metrics. 
Measurement of order in the bilayer. For each peptoid (Fig. 3a), we designate the 
position of the backbone nitrogen of the ith residue to be r;. A simple measure of 
order is the angle O,ackbone,x between each peptoid backbone and the x axis 
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a-Helical and B-sheet segments from proteins. A structure database was obtained 
from the Structural Classification of Proteins (SCOPe; release 2.03) that contains 
proteins with no more than 40% sequence identity with each other (http://scop. 
berkeley.edu/downloads/pdbstyle/pdbstyle-sel-gs-bib-40-2.03.tgz). «-Helical and 
B-sheet segments were identified using the DSSP (Define Secondary Structure of 
Proteins) algorithm™. (¢, ) pairs were obtained for all residues within «-helical 
and f-sheet segments, which contributed towards the histograms in Fig. 4b, d, 
respectively. 

Quantum mechanics calculations. Density functional theory was used to assess 
the low-energy structures available to a peptoid backbone consisting of two altern- 
ating sets of dihedral angles (¢, /). B3LYP and M05-2X were the two functionals 
used to calculate the energies of a configuration. B3LYP is a commonly used and 
widely applicable functional**, and M05-2X is a newer functional that accounts for 
dispersion forces”. Gaussian 09 (C.01) was used for density functional theory 
calculations’. For each peptoid model, the alternating pattern is defined by the 
numbers A and B (Fig. 4h), where adjacent (, ) pairs take the values (— A, B) and 
(—B,A). A and B were constrained to range from 50° to 180° in steps of 10°. 
Molecules were optimized at the HF/6-31G* level of theory**, with A and B 
constrained and all other degrees of freedom unconstrained. The lowest energy 
associated with each (A, B) pair is reported in the Ramachandran plots of Fig. 4h 
and Supplementary Figs 24-26. 
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Atmospheric warming is projected to increase global mean surface 
temperatures by 0.3 to 4.8 degrees Celsius above pre-industrial 
values by the end of this century’. If anthropogenic emissions con- 
tinue unchecked, the warming increase may reach 8-10 degrees 
Celsius by 2300 (ref. 2). The contribution that large ice sheets will 
make to sea-level rise under such warming scenarios is difficult to 
quantify because the equilibrium-response timescale of ice sheets is 
longer than those of the atmosphere or ocean. Here we use a coupled 
ice-sheet/ice-shelf model to show that if atmospheric warming 
exceeds 1.5 to 2 degrees Celsius above present, collapse of the major 
Antarctic ice shelves triggers a centennial- to millennial-scale 
response of the Antarctic ice sheet in which enhanced viscous flow 
produces a long-term commitment (an unstoppable contribution) 
to sea-level rise. Our simulations represent the response of the 
present-day Antarctic ice-sheet system to the oceanic and climatic 
changes of four representative concentration pathways (RCPs) from 
the Fifth Assessment Report of the Intergovernmental Panel on 
Climate Change’. We find that substantial Antarctic ice loss can 
be prevented only by limiting greenhouse gas emissions to RCP 
2.6 levels. Higher-emissions scenarios lead to ice loss from 
Antarctic that will raise sea level by 0.6-3 metres by the year 2300. 
Our results imply that greenhouse gas emissions in the next few 
decades will strongly influence the long-term contribution of the 
Antarctic ice sheet to global sea level. 

Ice sheets lose mass and contribute to global mean sea level (GMSL) 
by surface and basal melting, and by dynamic thinning**. Melting 
typically occurs at the ice surface if air temperatures are above zero, 
and at the ice base either where grounded ice is at the pressure melting 
point, or where ice is afloat in the ocean®. Owing to the exchange of heat 
between ice and adjacent water or air, changes in environmental 
temperature may cause changes in melt rate. Dynamic ice loss involves 
large-scale adjustment of part of an ice sheet to a change in the force 
balance that determines the ice flow speed (for example, loss of a but- 
tressing ice shelf’). There may be a delay in the dynamic ice-sheet res- 
ponse because although perturbations may be transmitted rapidly 
through ice shelves, the grounded ice sheet takes time to relax to a 
new steady state. Understanding which environmental changes lead to 
an immediate, and which to a lagged, ice-sheet response is important in 
predicting the timescale and transient evolution of the perturbed ice- 
sheet contribution to GMSL, as well as the total contribution that anthro- 
pogenic greenhouse-gas emissions will ultimately commit the planet to. 

In Antarctica, peripheral ice shelves restrain the flow of grounded 
ice’ and are sensitive to warming air and ocean temperatures*”®. 
Observations show that 67% to 98% of ocean warming since 2006 
occurred in the Southern Ocean!!, with a similar multidecadal trend 
in the heat content of circum-Antarctic waters’. Air temperatures 
across Antarctica, especially in West Antarctica, have risen on average 
0.6°C in the past 50 years’’. Precipitation has also increased, but 
there remains a net acceleration of mass loss from Antarctica’* that 
continues to contribute to rising sea levels'’. Recent studies indicate 


that parts of West Antarctica may already be undergoing irreversible 
retreat’®’”, and several studies'*!” have focused on quantifying the 
resultant Antarctic contribution to GMSL by 2100. Predicting the 
multi-centennial-scale to millennial-scale commitments implied by 
future climate scenarios” and palaeoclimate equilibrium ice-sheet 
reconstructions for high-carbon-dioxide (CO ) conditions*’” has 
received less attention, however. 

Some studies have investigated long-term commitments to sea-level 
rise under global warming scenarios using statistical relationships 
between past temperatures and global sea levels***’. Here we present 
numerical simulations of the Antarctic ice-sheet/ice-shelf system res- 
ponse to environmental changes predicted by four RCPs of the Fifth 
Assessment Report of the Intergovernmental Panel on Climate 
Change’ (Extended Data Fig. 1; Extended Data Table 1), which we 
extend to 5000cE to capture the multi-millennial response (see 
‘Methods’ and Extended Data Fig. 2). We use the Parallel Ice-Sheet 
Model, an open-source, three-dimensional, thermodynamic, coupled 
ice-sheet/ice-shelf model”*”*, and run simulations at spatial resolutions 
of 10km and 20km. Our method employs two different grounding- 
line parameterizations to quantify the likely range of ice-sheet 
responses. One implementation uses sub-grid interpolation of basal 
melting at grounding lines*® whereas the other does not’ (see 
Methods). Because the former tends to accelerate grounding-line 
retreat in coarse-resolution models such as ours we refer to sea-level 
contributions from these simulations as ‘high’ and those that do not 
include the sub-grid basal melt interpolation as ‘low’. Other schemes 
may produce even higher or lower values, however. 

The most immediate response to predicted climate changes in all but 
our RCP 2.6 experiment is a reduction in extent of the major ice shelves 
(Ross, Filchner-Ronne and Amery) within 100-300 years (Fig. 1). 
Reduced ice-shelf buttressing leads to increased discharge and 
flow acceleration that promotes grounding-line recession in areas 
where marine-based ice sheets occupy deep basins (such as West 
Antarctica; Fig. le, h, k). In our ‘high’ simulations, prolonged warming 
also leads to substantial loss of ice in East Antarctica, particularly in the 
Wilkes subglacial basin, at the margin of the Aurora subglacial basin, 
and in the southern and eastern Weddell Sea embayment (pale blue 
shading and blue lines in Fig. 1f, i and 1). Under RCP 8.5 the range of 
sea-level contributions predicted by our ‘low and ‘high’ simulations 
respectively is 0.1-0.39 m by 2100, increasing to 1.6-2.96m by 2300 
and 5.2-9.31 m by 5000 cE (Figs 1j-] and 2a). Rates of sea-level rise are 
5.5-15 mm per year by 2300 under RCP 8.5 conditions, but even under 
the lesser forcings of RCP 4.5 and RCP 6 reach 3-5 mm per year by 
2300 (Figs le, h and 2a). These simulations show that systemic lags 
delay the fastest rates of ice-sheet loss by decades to centuries following 
the onset of initial forcing (Fig. 2a). 

The initial ice-sheet/ice-shelf responses are important, but are rela- 
tively modest compared to the responses triggered now that will take 
place (committed changes) over subsequent millennia (Fig. Ic, f, i, 1). In 
our experiments we see clear correspondence between the magnitude of 
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Figure 1 | Modelled ice-sheet evolution under Antarctic-specific RCP- 
based warming scenarios. Emissions-forced climate warming to 2100 cE 

(a, d, g, j) and 2300 cE (b, e, h, k) results in initial sea-level contributions from 
Antarctica that are only a small proportion of the total sea-level commitment 
by 5000 cE (c, f, i, 1). Magnitudes and rates of sea-level contributions are 
shown for each panel. Leading values and those in parentheses relate to ‘low 
and ‘high’ scenarios respectively. Ice extent for ‘low simulations is shown in 


a multi-millennial environmental perturbation, the steady-state area 
of fringing ice shelves, and the long-term ice-sheet contribution to 
GMSL (Fig. 2b). Specifically, we observe a sharp decline in near-equi- 
librium ice-shelf extent when atmospheric and oceanic temperatures 
are maintained 1.2 °C and 0.3 °C respectively above present. Almost 
all floating ice is lost if equilibrium air and ocean temperatures 
increase by more than 2°C and 0.5°C above present, respectively. 
These ice shelf ‘thresholds’ (defined here as an abrupt reduction in 
area to 50% of present) also occur during centennial-scale adjustment 
of the ice sheet to new equilibria, but happen at different temperatures 
(Fig. 2b). We infer therefore that short-lived (decadal-scale) envir- 
onmental perturbations may mimic the longer-term responses if the 
former are of a large enough magnitude. 

To isolate the causes of lags in the simulated ice-sheet/ice-shelf 
system and identify mechanisms by which the system responds to 
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white; blue lines show grounding-line locations for ‘high’ simulations. Pale 
blue shading shows grounded ice lost in ‘high’ simulations but present in the 
‘low’ scenario. Note the increasing divergence between ‘high’ and ‘low’ beyond 
2300 cE. Grey texturing indicates areas of relatively faster-flowing ice. WAIS, 
West Antarctic Ice Sheet; EAIS, East Antarctic Ice Sheet; FRIS, Filchner-Ronne 
Ice Shelf. 


atmospheric and oceanic forcings, we ran 32 sensitivity experiments 
(using the full grounding-line scheme) that isolated changes in air 
temperature, precipitation and sea surface temperature (ATyi,, AP or 
and ASST) and simplified the time-varying RCP forcings (Fig. 3a, b). 
Each simplified forcing experiment comprised a perturbation-free 
interval from 0-2000 cE and a linear increase in Tj, Peg or SST from 
2000 cE to either 2100 cE or 2300 cE, after which the forcing was main- 
tained unchanged for the remainder of the run (up to 5000 cE). This 
simplification was designed to clearly identify modelled ice-sheet res- 
ponse to each applied forcing. 

Figure 3 illustrates the response of the modelled ice sheet when 
forced with each of the three environmental forcings (AT,i,, APetr 
and ASST) in isolation, and when all three are combined, under the 
warmest of the simplified RCP scenarios (RCP 8.5) and using the 
full grounding-line scheme. Figure 3c and d shows that the timing of 
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Figure 2 | Antarctic Ice Sheet (AIS) contribution to GMSL. a, Predicted sea- 
level contribution from the Antarctic ice sheet for ‘high’ and ‘low’ simulations 
(coloured lines) under each of the four RCP scenarios (darker shading), 
based on coeval climatic and oceanic perturbations. The forced response 
(grey shading) represents 20% to 36% of the committed response by 5000 cE. 
Lighter shading between coloured lines shows rates of sea-level-equivalent ice 
loss for each scenario. b, Long-term sea-level commitment as a function of 
atmospheric warming (blue shading with squares). Intermediate response 


ice-sheet response to a warming atmosphere is similar for both 2100 
and 2300 scenarios with rate-of-change maxima at 2244-2316 CE, 
several centuries from the start of the perturbation. In contrast, peak 
ice-volume responses to precipitation changes manifest only 11-41 
years after the end of each forcing period. A warming ocean results 
in a 26-year peak-response lag when forced to 2100 cE, but there is no 
lag when forced to 2300 cE. In summary, precipitation changes pro- 
duce rapid sea-level-equivalent ice-volume changes because of their 
direct and immediate addition of mass to the ice sheet. Warming of the 
atmosphere leads to an immediate loss of mass through surface melt in 
some areas (especially West Antarctica), but the effect is small and 
generally a warming atmosphere produces a lagged volumetric res- 
ponse that requires 200-300 years from the initial forcing to reach a 
maximum. Thermal changes in the ocean bring about rapid ice-sheet 
responses that, using the full grounding-line scheme, are far greater in 
magnitude than those arising from atmospheric forcings alone, and 
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Figure 3 | Response of modelled Antarctic ice sheet to simplified 
environmental forcings. a, b, The relative magnitudes of air temperature 
AT, ir, effective precipitation AP. and sea surface temperature ASST forcings 
for simplified RCP scenarios based on likely Antarctic values at 2100 CE (a) and 
2300 cE (b). c, d, Trends in sea-level-equivalent ice mass rate-of-change 

that arise using the full grounding-line scheme and when forced with changes 
in AT, ir, APeg, ASST, or the combination of these forcings, based on simplified 
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curves for the ‘low’ simulations are shown as dotted lines. Red shading with 
triangles shows the relationship between ice-shelf area and atmospheric 
warming for the near-equilibrium response and for intermediate stages (dotted 
lines). All curves in b are based on data from the four RCP scenario simulations, 
further constrained by two additional experiments whose maximum air 
temperature forcings are 1.5 °C and 3.35 °C. Pink shading defines the 
temperature range within which an ice shelf extent that is less than 50% of the 
present extent is simulated. 


exhibit little to no lag. Despite these differences, all forcings lead to 
committed responses that manifest as rates of change that are higher 
than those of the initial conditions for thousands of years after the 
forcing period (Extended Data Fig. 5). 

The mechanisms responsible for the differing responses seen in 
Fig. 3 can be inferred from time-series of glaciological changes 
(Fig. 4). Rising air temperatures increase the total volume of temperate 
ice (ice that is close to melting point; brown line in Fig. 4a), which, 
because it is softer and more easily deformed, leads to an increase in the 
non-sliding, or ‘creep’, component of domain-averaged grounded-ice 
velocity (Fig. 4c) (the domain is the model grid). Together with an 
accompanying increase in sliding velocity, atmospheric thermal 
forcing leads to gradual thinning of parts of the ice sheet and a con- 
sequent reduction in the domain-averaged gravitational driving stress 
(Fig. 4b). Although the warming perturbation in each simulation is 
applied only from 2000-2300 cE, softening of the ice and non-sliding 


b 


(weak Jad jugjeainbea jane] eas jo 
$2J}@UUI|||W) eaHueYyo sseu Jo ayey 


1900 2000 2100 2200 2300 2400 2500 


Years (cE) 


RCP 8.5 scenarios for 2100 (c) and 2300 (d). Note the different y-axis scales. 
Coloured numbers in c and d identify peak rates of change in the 100-year 
Gaussian-smoothed response curves. Data are shown relative to zero at 2000 CE. 
Grey shading shows periods of applied forcing. Absolute changes in ice-sheet 
volume and area (rather than rates of change) to 5000 cE are shown in 
Extended Data Fig. 4. 
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Figure 4 | Glaciological changes taking place 


a b 
——= AT under single-parameter environmental 
— 16 air 26.0 5 ; ‘i F 
< ——.\p perturbations. a, Domain-wide changes in the 
= eff . rar 
B ASST = volume of temperate ice. b, Mean driving stress of 
= Aa . » grounded ice. c, d, Averages of the non-sliding 
2 — All forcings 24.0 > ee , 
g 2 (c) and sliding (d) components of grounded ice 
2 42 = velocity. All curves are from the simplified RCP 8.5, 
gS - 220 % 2300 cg, ‘high’ simulations (Fig. 3b). Grey shading 
Ke) a denotes period of applied forcing. 
2 1.0 2 
2 20.0 % 
6 
> & 
0.8 
18.0 
c d 70.0 
= 16.0 z 
> 60.0 
E FS 
> 14.0 = 
8 50.0 2 
2 12.0 oy 
g 400 8 
5 10.0 = 
5 a 
3 30.0 5. 
8.0 = 
20.0 
2000 3000 4000 5000 2000 3000 4000 5000 
Years (cE) Years (cE) 


ice velocity continue to increase throughout the experiment, perhaps 
as a consequence of internal feedbacks such as strain heating, which 
would encourage faster ice flow and advection of ‘warmer’ ice deeper 
into the ice sheet, as proposed previously**. Increased precipitation, 
even when applied in the absence of atmospheric warming, leads to an 
increase in the volume of temperate ice (grey line in Fig. 4a), probably 
because highest precipitation occurs in coastal areas under relatively 
warm conditions. Increased accumulation results in a greater domain- 
averaged ice thickness, which in turn increases the mean driving stress 
(Fig. 4b). Non-sliding ice velocities increase markedly throughout the 
simulation (Fig. 4c), probably as a consequence of increased ice thick- 
ness and increased driving stress”. 

By reducing the volume of floating ice shelves through basal melt- 
ing, increased ocean temperatures lead to an overall reduction in the 
proportion of temperate ice in the domain (purple line in Fig. 4a). 
Thinning of grounded ice in response to loss of buttressing ice shelves 
means that the domain-averaged driving stress is reduced (Fig. 4b), but 
both sliding and non-sliding components of ice velocity increase 
abruptly during the forcing period (Fig. 4c, d). The acceleration of 
sliding velocities probably reflects the reduction in back-pressure 
exerted by the rapidly diminishing ice shelves’, whereas deformation 
rates probably increase as a consequence of internal strain heating and 
frictional heating at the bed due to faster flow”. Flow acceleration then 
declines from about 3000 cE, perhaps because grounding lines occupy 
stable slopes once the major deep basins have been vacated, illustrating 
that the dynamic response to a loss of buttressing is transient and may 
be topographically controlled. 

In summary, oceanic warming produces a much greater and more 
rapid ice-sheet response than atmospheric warming, but the rate of ice 
volume change forced by the warming of the ocean peaks and declines 
more quickly than the rate of change forced by atmospheric warming, 
which is more gradual and sustained for longer (Extended Data Fig. 5). 
Whereas ocean-forced perturbations produce the greatest ice-sheet 
contribution to sea level on centennial timescales (approximately five 
times that produced by increased air temperature alone when the full 
grounding-line scheme is used), contributions due to atmospheric 
warming become increasingly important over multi-millennial peri- 
ods. The loss of buttressing ice shelves, the consequent increase in 
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sliding velocity of grounded ice and thinning at the ice-sheet margin, 
combined with the thermal softening effect of warming atmospheric 
temperatures that increases the creep-rate of grounded ice, all govern 
the long-term response of the ice sheet and lag times of this system. 
Ice-sheet response to external forcing is therefore mediated both by 
ice-stream dynamics and the timescale of adjustment of viscous flow 
(‘creep’) of grounded ice, which together produce a commitment to sea 
level rise that persists for multiple millennia. 

Unless anthropogenic greenhouse-gas emissions are reduced to 
half of 1990 levels by 2050, global mean annual surface temperatures 
are likely to exceed 2 °C above pre-industrial values by 2100 (ref. 30). 
This aggressive mitigation scenario corresponds to RCP 2.6, which 
both our ‘low and ‘high’ simulations show is the only ocean-climate 
regime in which the long-term Antarctic contribution to GMSL does 
not exceed 1m (Fig. 2a). Under all other RCP scenarios the future 
commitment to a rise in sea level from Antarctica is substantial. This 
commitment arises because the collapse of buttressing ice shelves 
leads to a dynamic ice-sheet response that greatly increases grounded 
ice discharge for hundreds to thousands of years into the future, even 
if greenhouse-gas emissions are reduced and temperatures stabilize 
(Supplementary Videos 1 and 2). The results presented here and 
elsewhere® suggest that ice-shelf stability is vulnerable to a critical 
temperature threshold. In our experiments, we find that prolonged 
ocean warming of 0.5 °C above present, together with atmospheric 
warming of 2 °C, ultimately leads to the loss of 80% to 85% of all 
floating ice in Antarctica. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


The ice-sheet model. Continental-scale studies such as ours are currently unable 
to reproduce the level of detail that catchment-scale studies resolve*!*’, but they 
nonetheless offer useful insights that arise from their wider geographic coverage. 
For our suite of experiments we use the Parallel Ice Sheet Model (PISM) version 
0.6.3, an open-source, three-dimensional, thermodynamic, coupled ice-sheet/ice- 
shelf model. The model combines equations of the shallow-ice and shallow-shelf 
approximations for grounded ice, and uses the shallow-shelf approximation for 
floating ice**. Superposing the shallow-ice and shallow-shelf velocity solutions 
allows basal sliding to be simulated according to the ‘dragging shelf approach”, 
and enables a consistent treatment of stress regime across the grounded-ice to 
floating-ice transition”’. Ice streams develop naturally as a consequence of plastic 
failure of saturated basal till’*, depending on the thermal regime and volume of 
water at the ice-sheet bed. The amount of water that saturates basal till varies 
spatially according to the enthalpy field, but is limited in our implementation so 
that no more than a 2-m thickness of saturated substrate accumulates. This sub- 
glacial hydrology model does not conserve water, in the sense that any additional 
water above the imposed maximum thickness is permanently lost. Till pore water 
pressure is defined as a maximum of 0.8times the ice-overburden pressure 
(Extended Data Table 2). The effective pressure calculated from meltwater thick- 
ness and ice overburden, together with the spatially varying till friction angle, is 
then used to calculate the yield strength of basal substrate, which will vary in time 
and space as basal meltwater volumes change. The till friction angle is prescribed 
heuristically at the start of the run, and follows a simple elevation dependence such 
that deeper basins that are likely to hold deformable sediments are assigned lower 
values than higher areas, which are more likely to be rock. In our simulations, we 
choose values of 10° for areas below — 1,000 m, and 30° for areas above 200 m. 
Intermediate values are linearly interpolated between these extrema. For compu- 
tational tractability we employed a resolution of 10km for our ‘realistic’ RCP 
simulations, anda resolution of 20 km for our ‘simplified’ RCP experiments, where 
the focus lies on investigating the differences between results of different envir- 
onmental forcings, rather than simply the magnitudes or rates of ice retreat. 

Migration of the grounding line is facilitated through a sub-grid scheme that 
calculates one-sided derivatives for surface slope above and below the grounding 
line, and interpolates basal shear stress in x, y according to the spatial gradient 
between adjacent grounded and floating cells’. Additionally, we impose reduced 
basal traction in the first cell upstream of the grounding line, in which the basal 
resistance calculated from the computed pore water content is replaced by a basal 
drag value that assumes saturation of the basal substrate. In this way, basal shear 
stress gradients across the grounding line are reduced and a more dynamic margin 
is facilitated, in line with current theory**”. For our ‘high’ simulations we also use a 
sub-grid scheme that interpolates sub-shelf melt rates at the grounding line’*”’. 

Climate data (air temperature, precipitation) are used as inputs to a positive 
degree-day scheme that calculates surface mass balance. We use degree-day factors 
of 3mmK™! per day and 8mmK™! per day for snow and ice respectively. 
Random white noise is included in the calculation to mimic natural variability. 
An elevation dependence of —0.008 Km’ is applied to air temperatures in areas 
where the surface elevation changes. Basal melting of floating ice is calculated 
thermodynamically using a three-component temperature-salinity scheme”. 
This scheme yields the highest melt rates adjacent to ice-sheet grounding lines 
and lower melt rates, or accretion, closer to the central sectors of modelled ice 
shelves. Calving of floating ice is calculated from horizontal strain rates*”** with an 
additional heuristically based minimum thickness condition applied. PISM incor- 
porates a sophisticated bed deformation model in which changes in ice load 
effect both elastic and viscous responses in the underlying bedrock’’. Time- 
dependent variations in relative sea level therefore arise in our simulations from 
isostatic adjustment according to the flexural rigidity of the crust and the viscosity- 
dependent lateral displacement of the mantle. Self-gravitational sea-level effects 
that arise from changes in ice-sheet mass are not included, however, and conse- 
quently our model may not capture the potentially stabilizing feedback of a local 
lowering of sea level**. This effect, which becomes more important over multi- 
centennial timescales, is typically smaller than isostatic effects, which are included 
in our simulations, so the influence on our short-term predictions should be 
minimal. Because we also do not account for self-gravitational rises in local sea 
level, which may occur in response to loss of ice from distal sectors of the ice sheet, 
we may in fact underestimate margin retreat in some areas. 
Climate and ocean inputs. 
Present-day conditions. To define the initial conditions for our experimental 
ensemble, we employ spatially distributed data sets of annual mean air tempera- 
tures, annual precipitation totals, and circum-Antarctic sea surface temperatures 
from a selection of recent observational and model-based compilations. 

The most recent and widely used map of Antarctic surface mass balance derives 
from regional atmospheric climate modelling using RACMO2.1/ANT45. These 


data span the period 1979-2010, are high-resolution (27 km), and incorporate 
snow-drift physics that greatly improve the fit of simulated mass balance to around 
750 in situ measurements (compared to models that ignore snow drifting). By 
capturing the erosional as well as the sublimation effects of wind, the model" is 
able to compute both surface accumulation and the location and extent of ablation 
areas. Annual mass balance across the continent is calculated to be 2,418 + 181 Gt 
per year with only minor interannual variability but a pronounced, winter- 
dominated, seasonal cycle. 

Antarctic surface air temperatures derived from Advanced Very High 
Resolution Radiometer (AVHRR) infrared data** and updated for the period 
1982-2004 (ref. 43) provide high-resolution (6.25 km) continent-wide coverage. 
Since temperatures are more easily and reliably mapped than surface accumula- 
tion, these data*** are commonly used in ice-sheet modelling. 

Oceanic temperatures are harder to measure, and for our experiments we use 
outputs from an established regional-scale climate model. This model (RegCM3) 
is coupled with the GENESIS version 3.0 Global Climate Model and is implemen- 
ted at 80-km resolution****. The model includes parameterizations of surface, 
boundary layer and most processes that account for the physical exchanges 
between the land surface boundary layer and free atmosphere. It includes detailed 
representations of snow and near-surface land ice and has been validated against 
modern observed polar climates and ice-sheet mass balances**. Oceanic outputs 
from this model capture the continental-scale pattern of Antarctic sea surface 
temperatures but probably underestimate temperatures in areas where recent 
subsurface warming has been most rapid, such as the inner Amundsen Sea. The 
underestimation of modern ocean temperatures in this area may be the primary 
reason that our simulations do not appear to show rapid grounding-line retreat 
into the Pine Island and Thwaites Glacier subglacial basins, as has been predicted 
by some models’®, but we also note that the muted and lagged response we see in 
the Amundsen Sea sector is consistent with the findings of another study**. 

Since our primary interest lies in continental-scale interpretations, and since 

our methodology is based on the analysis of deviations from initial conditions (that 
is, results are bias-corrected), we argue that if any local inaccuracies in our input 
fields exist they should not substantially affect our conclusions at the continental 
scale; however, we acknowledge this as an area where future refinements could 
be made. 
Future climate trajectories. To perturb our initial conditions along the trajectories 
forecast for coming centuries, we use Climate Model Intercomparison Project 
phase 5 (CMIP5) data for the four RCPs of the Fifth Assessment Report of the 
Intergovernmental Panel on Climate Change*. We extracted Antarctic-specific 
(60°-90° S) zonally averaged timeseries of air temperature (AT,;,), effective pre- 
cipitation (AP) and sea surface temperature (ASST) from the ensemble 
mean data set for the period 1890-2100 ce (Extended Data Fig. 1 and Extended 
Data Table 1). Because we use the ensemble mean, rather than the full suite 
of climate model projections presented in the Fifth Assessment Report of the 
Intergovernmental Panel on Climate Change’, our simulations do not capture 
the full range of possible responses implied by the spread of climate model pre- 
dictions. However, our aim is to present representative outputs that allow first- 
order differences between the main scenarios to be identified. 

Raw data were uniformly adjusted so that perturbations are relative to the 
present, to ensure consistency with our initial-condition data sets (above). This 
adjustment effectively results in air temperatures at 1890 CE approximately 0.6 °C 
below those of the present’. For each of the RCPs, we used the extracted Antarctic- 
specific air temperature, precipitation and ocean temperature anomalies to update 
the environmental boundary conditions of our ice-sheet model incrementally 
throughout the simulations. To extend the CMIP5 RCP values beyond 2100 cE 
we used long-term trajectories based on the Fifth Assessment Report of the 
Intergovernmental Panel on Climate Change**’. In these Extended Concen- 
tration Pathway scenarios, all emissions trajectories result in maximum atmo- 
spheric temperature perturbations by 2300 cz, and remain unchanged for the 
remainder of the simulation period (to 5000 cz)'*“” (Extended Data Fig. 2). In 
all of our experiments, the magnitudes of precipitation and ocean temperature 
changes are scaled to air temperature changes according to their respective vari- 
ance in the zonally averaged CMIP5 data. These indicate that a 1 °C increase in 
air temperature accounts for a 5.3% increase in precipitation, in agreement with 
long-term records**, whereas ocean temperature changes are approximately one- 
quarter of that seen in the atmosphere (Extended Data Fig. 1). 

In chapter 12 of the Fifth Assessment Report of the Intergovernmental Panel on 
Climate Change’, long-term (beyond 2300 ce) climate change projections are 
considered in terms of two scenarios: first, a situation in which atmospheric 
CO, remains constant at 2300 cE levels, and second, where atmospheric CO, 
declines steadily from 2300 cE to 3000 cE. This second scenario, however, results 
in only a modest reduction in surface air temperatures, and is based on reduction 
of emissions to zero by 2300 cE. The likelihood of such aggressive mitigation is not 
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known, but the available data appear to “..support the conclusion that tempera- 
tures would decrease only very slowly (if at all), even from strong reductions or 
complete elimination of CO2 emissions...” (page 1104 of ref. 3). On this basis, we 
propagate our environmental forcings beyond 2300 cE at their 2300 cE perturbed 
level. In most cases this is their maximum, although in RCP 2.6 maxima are 
reached by 2100 ck, after which they decline. 

Code availability. The Parallel Ice Sheet Model is freely available as open-source 
code from the PISM github repository (https://github.com/pism/pism). RegCM3 
is available from https://users.ictp.it/RegCNET/model.html. CMIP5 data were 
downloaded from http://climexp.knmi.nl/. Bedrock topography and ice thickness 
data are from the BEDMAP2 compilation, available at http://www.antarctica.a- 
c.uk//basresearch/ourresearch/az/bedmap2/. Information on surface mass bal- 
ance data is available at http://www.projects.science.uu.nl/iceclimate/models/ 
antarctica.php#racmo21. Air temperature and geothermal heat flux inputs were 
taken from the ALBMAP version 1 compilation* and can be downloaded from 
http://doi.pangaea.de/10.1594/PANGAEA.734145. 

Experimental methods. To establish differences in ice-sheet configuration arising 
under perturbed climate regimes, our initial requirement is for an accurate simu- 
lation of the present-day Antarctic ice sheet in which the grounded and floating 
portions are reproduced in a manner that is as close to observations as possible. 
Our principal guiding constraints are (1) the volume of grounded ice, (2) ground- 
ing-line positions, and (3) the pattern of surface velocities. Building on previous 
work”? we implement the following three-stage spin-up procedure in order to 
produce a thermally equilibrated and dynamically stable simulation of the present- 
day ice sheet, from which subsequent experiments can be restarted: 

(1) Initial 20-year smoothing run in which only the shallow-ice approximation 
is used to calculate ice flow. This allows the initial ice-sheet surface to relax slightly, 
removing any anomalies introduced in the initial data collation phase. During this 
brief run, the calving line is held fixed. 

(2) Intermediate 150,000-year run in which the ice geometry is held fixed, but 
the enthalpy field is allowed to evolve. This allows three-dimensional ice tempera- 
tures to evolve to equilibrium under the imposed initial climate conditions. 

(3) Final 25,000-year run in which full model physics are employed, including 
both shallow-ice and shallow-shelf approximations for velocity calculations”, 
viscoelastic bed deformation”, grounding-line migration” and calving’”**. 

During the 25,000-year final part of this spin-up procedure, all model bound- 
aries (calving line, grounding line, upper and lower surfaces) are free to evolve. 
Consequently, achieving an ice-sheet geometry at the end of the spin-up that is 
close to the observed present-day configuration requires careful parameterization. 
Iterative experimentation focused on manually adjusting flow enhancement fac- 
tors, basal hydrology and basal traction parameters, and calving coefficients. For 
each experiment ice thickness and surface velocities were compared to obser- 
vation-based data to gauge the degree of fit. Because we do not use inverse or 
iterative methods” our fit to present-day constraints has outliers (Extended Data 
Fig. 3). However, the positions of grounding lines and calving lines are well 
captured, and the sea-level-equivalent ice volume is within 4% of empirically 
calculated values”. By allowing our tuning experiments to evolve over such long 
timescales we ensure both that our results are not influenced by transient beha- 
viours and also that our parameterization is robust for multi-millennial integra- 
tions. This is essential, since to identify climate-forced perturbations in our study, 
model drift from incorrect parameterization needs to be minimized. 

From the optimal parameterization spin-up run we obtain a thermally equili- 
brated and dynamically stable present-day ice-sheet simulation that closely resem- 
bles the modern ice sheet (Extended Data Fig. 3). This simulation is the starting 
point for all of the RCP-based experiments. We then run a two-component 
experimental ensemble in which each of the environmental parameters (air tem- 
perature T,;,, precipitation P., ocean temperature SST) evolves according to the 
rates and magnitudes indicated in the extended CMIP5 ensemble mean trends 
as described above. One component of our experimentation involves 10-km- 
resolution runs using the full RCP forcing scenarios. The other component uses 
20-km-resolution runs to explore glaciological changes that occur in response to 
theoretical environmental perturbations. Each of the experiments runs for 
5,000 years, starting at year zero with climate and ocean forcings applied in 
calendar years through to 5000 cz. To minimize the possibility of any transient 
effects arising during the start-up procedure, environmental boundary conditions 
are held constant for the first 2,000 years, but all model boundaries (calving line, 
grounding line, upper and lower surfaces) are free to evolve. Any transient adjust- 
ments of the ice sheet occur during this period. From 2000 cE, timeseries envir- 
onmental forcings (described above) are imposed and the ice-sheet/ice-shelf 
system evolves accordingly. In addition to the forcing experiments we run an 
additional control experiment in which the same 5,000-year run is made, starting 
from identical initial conditions but with no environmental perturbations applied. 
This is essential for the identification and quantification of any model drift during 


LETTER 


the period of interest of the simulation (2000-5000 cz). Since this drift must arise 
from processes other than environmental forcing we use this control experiment 
to bias-correct outputs from all other simulations. Even though all model bound- 
aries are allowed to evolve freely, model drift in our control experiment is near zero 
during the period of interest for both ice-volume and ice-sheet extent (Extended 
Data Fig. 4), so we do not consider that the bias-correction affects our results. In 
fact, we consider the low model drift to be additional confirmation that our 
parameterization is robust. On this basis we are confident that the response seen 
in the RCP simulations arises solely as a consequence of the combined influence of 
the simultaneously applied forcings (Fig. 1). 

Grounding-line sensitivity and resolution dependence. A key part of our simu- 
lations is the migration of ice-sheet grounding lines in response to applied envir- 
onmental forcings. In theory, to obtain the most accurate predictions of 
grounding-line locations a numerical scheme is required that is able to solve the 
full set of Stokes equations for ice flow on a high-resolution (hundreds to thou- 
sands of metres) spatial grid**. For continental-scale simulations, however, such an 
approach presents large challenges, and therefore various alternative schemes 
exist”*”’. These alternatives attempt to capture the critical aspects of grounding- 
line behaviour while still remaining tractable over large domains (thousands of 
square kilometres) and long (multi-millennial) model integrations. Of primary 
concern is whether the grid size over which the scheme is implemented actually 
influences the model results. Here we use two new grounding-line components 
that together allow dynamic grounding-line behaviour even with our relatively 
coarse (10 km) grid, but which may also be regarded as sources of uncertainty in 
our future ice-volume projections. 

The first component is a sub-grid grounding-line scheme” that interpolates 
basal shear stress (and, optionally, basal melt fields) at the grounding line in order 
to facilitate the smoother migration of the grounding line. This approach attempts 
to capture the effects of lateral contact between ice and ocean as well as the effects 
of water intrusion beneath the ice sheet. Furthermore, it is logical that a retreating 
grounding line must be provided with an (oceanic) basal melt value at the sub-grid 
cell into which it migrates, rather than a basal melt value typical of the grounded 
ice that characterizes the uninterpolated cell. Whether or not the grounding line 
would migrate without this interpolation, however, remains a source of uncer- 
tainty. Thus in our simulations we present both ‘high’- and ‘low’-scenario esti- 
mates of sea-level-equivalent ice loss based on turning on or off the sub-grid melt 
interpolation. In other studies that use this scheme**, some evidence of resolution 
dependency is apparent, but whether this is a consequence of the sub-grid scheme 
or of some other aspect of the model implementation is not clear. 

The second component of the grounding-line scheme we adopt is an heuristic 
scheme that reduces basal traction in the first cell upstream of the grounding line. 
Smoothing the basal friction gradient across the grounding line should reduce the 
grid-dependency problems seen in artificial scenarios such as MISMIP3d*”™ that 
invoke large changes in basal traction between grounded and floating domains. 
However, smoothing the basal friction gradient at the grounding line may make 
this critical boundary more sensitive to environmental perturbations than in 
models where basal traction changes abruptly, and thus this also perhaps intro- 
duces an element of uncertainty in our ice-volume projections. 

Grounding lines are poorly observed in nature due to their inaccessibility, yet 
ice-shelf basal melt rates typically reach their maxima close to the grounding line, 
and may be as high as 100 m per year (ref. 55). These rates are, however, highly 
variable and thus while small environmental perturbations could result in rela- 
tively large ice-sheet responses in some areas, the consequences in other sectors 
could be more muted. Acknowledging these uncertainties, we ran a series of 
experiments at 10-km resolution in an attempt to quantify (at the continental 
scale and at the multi-millennial scale) the effects of removing individual aspects of 
the grounding-line scheme. Extended Data Fig. 6a illustrates sea-level-equivalent 
volume changes for each of these experiments, in which all other parameters were 
held constant. Using the sub-grid scheme without the basal traction scheme (green 
line in Extended Data Fig. 6a) results in a relatively minor change to ice volume 
compared to the RCP 8.5 simulation that uses the full grounding-line scheme (blue 
line), whereas using the basal traction scheme without the sub-grid scheme (purple 
line) results in a much less dynamic ice margin that responds less to applied 
environmental forcings. Using the basal traction scheme together with the sub- 
grid scheme in which basal melt is not interpolated produces an intermediate 
result (grey line). Extended Data Fig. 6e-g illustrates the effect of this change in 
terms of grounding-line migration through time. If neither the sub-grid nor basal 
traction schemes are used, the simulated ice sheet not only increases in volume 
above present-day values, but also becomes less responsive to the applied forcings 
(orange line). 

This sub-grid grounding-line scheme is clearly important, and has been shown 
to greatly improve the accuracy of grounding-line migrations compared to earlier 
PISM releases”, but its implementation may still result in a certain amount of 
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resolution dependence under certain circumstances’”*. To assess the uncertain- 
ties that may arise from grid resolution alone, we implemented a series of experi- 
ments in which we employed identical parameterizations (including sub-grid melt 
interpolation) except for the horizontal grid spacing used to solve prognostic 
equations. Extended Data Fig. 6b and c illustrates the convergence of results that 
occurs when the grid is progressively refined from 80 km to 5 km. The finest of 
these runs (5 km) was conducted only to 3100 cz, owing to the large computational 
overhead. In terms of final sea-level-equivalent ice volumes, there is a greater 
difference between the 40-km and 80-km runs than there is between the 20-km 
and 40-km runs, and there is no difference at all between the final state of the 
10-km and 20-km simulations. At even finer grids, the overall trajectories of ice- 
volume and grounded-ice area at 10 km resolution agree closely with those of the 
first three millennia of the 5-km-resolution simulation. The spatial variability that 
arises at these different resolutions is illustrated in Extended Data Fig. 6d for the 
2500 cE time slice, identifying that the greatest resolution-dependent uncertainty 
appears to occur in the Siple Coast region of West Antarctica, but even in this area 
there is little difference between the 10-km and 5-km results after 500 years of 
environmental forcing. 

Although the 10-km and 20-km simulations converge on identical final ice 
volumes, the coarser of these two experiments actually simulates slightly greater 
ice loss, because it deviates from the ‘control’ ice volume of our simulated present- 
day ice sheet by approximately + 0.5 m sea-level equivalent. To see whether this 
offset could be reduced, we ran 20 new experiments that explored a range of stress 
balance parameterizations. We found that by making minor adjustments to both 
the shallow-ice and shallow-shelf flow enhancement factors (Extended Data 
Table 2), fully dynamic simulations of the present-day ice sheet could be produced 
(at either resolution) that are freely evolving over multi-millennial timescales but 
exhibit minimal model drift and fit equally well to present-day geometric and 
dynamic constraints at the continental scale. Although these parameterizations 
were tuned only to reproduce the geometry and dynamics of the present-day ice 
sheet, we found that similar patterns and rates of grounding-line migration were 
simulated under RCP forcing scenarios (Extended Data Fig. 6h-j). 

Together, the suite of sensitivity experiments described above illustrate that 
grounding-line treatment can strongly influence predictions of both the rate of 
ice-sheet retreat, and the total volume lost for any given forcing. Although the sub- 
grid melt interpolation scheme still requires physical verification, and tends to 
favour grounding-line retreat, values from our ‘high’ simulations may still under- 
estimate ice-sheet responses that arise from retreat mechanisms or systemic feed- 
backs not included in our model. Likewise, the more conventional implementation 
of sub-grid grounding-line scheme that neglects interpolated basal melt results in 
more conservative sea-level contributions in our ‘low’ simulations, but still lower 
values could be possible if unaccounted-for stabilizing mechanisms come into 
play. Given that geological reconstructions tend to indicate a relatively high ice- 
sheet sensitivity to warmer climates of the past*”*, we find no compelling reason to 
favour the conventional grounding-line implementation over the one that 
includes interpolated basal melt. Regardless of the absolute magnitudes, however, 
our simulations consistently identify a three-stage response to environmental 
warming: an initial loss or reduction in ice-shelf area, a dynamic response of 
grounded ice, and a long-term commitment to sea-level rise that continues for 
centuries to millennia beyond the initial forcing period. 

Polar amplification and long-term equilibrium response. Although there exists 
some uncertainty as to the form of the RCP projections, particularly beyond 
2300 cE, in our experiments we are concerned with the range of likely responses 
to a plausible range of environmental forcings, and as such we are confident 
that our ensemble approach captures the likely breadth of Antarctic ice sheet 
response. Palaeoclimate data and models indicate that, over millennial timescales, 
polar amplification processes can lead to surface air temperatures, and ocean 
temperatures, that may be as much as double the magnitude of the global mean 
perturbation*®*. If this is the case, then our RCP-based experiments represent a 
conservative lower estimate of the likely ice-sheet response. Furthermore, because 
we do not use a coupled climate model in our simulations, we cannot fully capture 
elevation-dependent mass-balance feedbacks that might take place as the ice-sheet 
geometry evolves. Although we use an elevation-dependent lapse rate to modify air 
temperatures through the run, it is also likely that, where ice shelves and grounded 
ice in marine basins are greatly reduced in extent, circulation changes will arise 
that may bring warmer and wetter air masses further into East Antarctica. To 
assess the likely implications of even greater warming of the southern high lati- 
tudes, we therefore ran duplicate experiments of the RCP 8.5 scenario, with and 
without sub-grid melt interpolation, and allowed air and ocean temperature 
anomalies to double by 2300 cz, compared to the original RCP 8.5 experiment. 
This yields surface air and ocean temperature anomaly maxima of 16°C and 
4°C respectively, by 2300cE and beyond. The consequence of this extreme 


scenario compared to the original RCP 8.5 scenario is illustrated in Extended 
Data Figs 7 and 8. 

We note that although the ice-sheet geometry is visually similar between RCP 
8.5 and RCP 8.5A for the ‘high’ scenario (blue outline), the sea-level contribution is 
2.1 m greater by 5000 cE under a polar-amplified climate, and the ongoing rate of 
ice mass loss is 80% higher (Extended Data Fig. 7a, b). In the ‘low’ experiments the 
difference in sea-level contribution is much greater, with the loss of the WAIS and 
a considerably higher sea-level contribution arising with amplified warming 
(8.57 m compared to 5.2 m). With polar amplification, grounding-line retreat into 
Wilkes Basin occurs under both ‘high’ and ‘low’ scenarios (Extended Data Fig. 7b, c). 
Extending the simulation to 50,000 years (Extended Data Fig. 7c, d) the full extent of 
the warming-based commitment under the ‘high’ scenario is far greater than that 
reached by 5000 cE (15.73 m sea-level equivalent compared to 11.42 m). Under the 
‘low’ scenario the difference is much less (9.68 m compared to 8.57 m). 
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Extended Data Figure 1 | CMIP5 multi-model ensemble mean anomaly timeseries data. Air temperature (a—d), precipitation (e-h) and ocean temperature 
(i-l) changes for four RCP scenarios expressed as perturbations from present, both for hemispheric sectors and for the global mean. 
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Extended Data Figure 2 | Long-term RCP temperature scenarios. atmospheric temperatures, with magnitudes based on analysis of the 
Antarctic-specific (60°-90° S) projected temperature trends to 2300 cE based © CMIP5 data set indicating a 5.3% increase in precipitation per degree air 
on CMIP5 values at 2100 cE and extended to 2300 ce following trajectories temperature increase and a ratio of 0.25 for converting atmospheric to oceanic 
of global means from intermediate-complexity Earth system models*”’. temperature changes. dT, change in air temperature. 
Precipitation and ocean temperature trends are calculated to follow those of 
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Extended Data Figure 3 | Model spin-up and fit to present-day. Ice-sheet thicknesses shown ina and b. d and e, Measured® (d) and modelled (e) surface 
geometry and surface velocities (ma‘’, metres per year) at the end of a 25,000- _ ice velocities. f, Comparison of the ice velocities shown in d and e. ‘MeASURES’ 
year evolutionary simulation. a and b, Observation-based” (a) and is the name of the published ice velocity dataset of ref. 59. 

modelled (b) ice-sheet extent and surface elevations. c, Comparison of ice 
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Extended Data Figure 4 | Multi-millennial changes in ice-sheet volume and _ simplified RCP scenarios for 100-year and 300-year forcing periods. Also 
area. Simulated changes in ice-sheet volume (a) and ice-sheet area (b) under _ shown in both panels is the control experiment (thick blue lines), illustrating 
single-parameter and combined forcings (AT,ir, AP.r¢ and ASST), based on little to no drift during the period of interest to 5000 CE. 
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Extended Data Figure 5 | Multi-millennial changes in ice-sheet response to 
single-parameter environmental forcings. Bias-corrected rates of sea-level- 
equivalent ice-mass change (s.l.e. mma’, millimetres of sea level equivalent per 
year) for each of the single-parameter simplified RCP forcing experiments. 

a, Rates of change under applied air temperature forcings peak at around 2240- 
2330 cE for both the 100-year and 300-year forcing experiments and decline 
thereafter, but both the 100-year and 300-year experiments exhibit rates that 
are still much larger than initial values by 5000 cz. b, Ice-mass rates of change as 
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in a but forced only with precipitation changes. Maxima for the 100-year and 
300-year forcing experiments occur close to the end of the forcing period, 
reflecting little inherent lag. c, Rates of change in response to ocean forcing 
show much more elevated initial peaks compared to mass-loss rates in 
subsequent millennia. By the end of the run, rates of mass loss for both the 
100-year and 300-year forcing experiments are still higher than at the beginning 
of the run. Data are shown relative to zero at 2000 CE. 
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Extended Data Figure 6 | Model sensitivity and uncertainties. a, Modelled 
ice volume changes relative to the control run (in sea-level equivalent) for 
simulations in which only the grounding-line parameterization is altered. ‘SG’ 
and ‘Slip’ denote respectively the sub-grid and reduced traction grounding-line 
schemes employed in our simulations; ‘no SGmelt’ indicates an experiment 
in which only the sub-grid basal melt interpolation scheme is turned off (see 
Methods for details). Grey shading denotes period of applied forcing. b and 
c, Domain-integrated grounded (gr.) ice area (b) and sea-level-equivalent ice 
volume (c) trajectories under RCP 8.5 conditions for simulations in which the 
full sub-grid scheme is used and only the model resolution is changed. The 
5-km simulation was not run beyond 3100 cE owing to the large computational 


overhead. d, Grounding-line positions at 2500 cE for the experiments 

shown in b and c. The greatest differences occur in the Siple Coast area (SC). 
Note the very close agreement between the 10-km and 5-km simulations. 
e-g, Grounding-line locations under RCP 8.5 conditions at 2100 cE (e), 2300 CE 
(f) and 5000 cE (g) for experiments using the full grounding-line para- 
meterization (“SG+Slip’) compared to those in which the sub-grid basal melt 
interpolation is turned off (‘no SGmelt’). h-j, Grounding-line locations 
under RCP 8.5 conditions at 2100 CE (j), 2300 cE (i) and 5000 cE (j) for 10-km 
and 20-km simulations that use the full grounding-line parameterization 

and resolution-specific stress balance tunings. 


©2015 Macmillan Publishers Limited. All rights reserved 


5000 CE 


Rate (mm a7) 


5000 


Extended Data Figure 7 | The effect of polar amplification. a, Geometry 
of the modelled Antarctic ice sheets under RCP 8.5 at 5000 cz, using both 
variants of the grounding-line scheme. Bold values and those in italics denote 
magnitudes and rates of sea-level contributions respectively. Leading values 
and those in parentheses relate to ‘low and ‘high’ scenarios respectively. Panels 
show ice extent for ‘low’ simulations; blue lines show grounding-line locations 
for ‘high’ simulations. Pale blue shading shows grounded ice lost in ‘high’ 
simulations but present in the ‘low’ scenario. b, Duplicate simulation to a but 
using a 2 X amplification of Antarctic temperatures beyond 2300 ce. Note the 
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greater sea-level contribution compared to a. c, The full equilibrium response of 
the polar amplification scenario shown in b. Note the greater loss of ice from 
the Wilkes Basin (WB) and eastern Weddell Sea (WS), resulting in a higher 
total sea-level contribution. Black areas denote ice-free land. d, Rate of ice loss 
for the 2 X amplification scenario for ‘low (black) and ‘high’ (blue) scenarios, 
illustrating that although the fastest contribution to sea level (2-4 m per 
century) occurs during the first millennium, slower mass loss continues for 
many millennia thereafter. 
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Extended Data Figure 8 | Antarctic contribution to GMSL. a, Predicted sea-__ curves for the ‘low’ simulations are shown in dotted lines. Red shading with 
level contribution from the AIS for ‘high’ and ‘low’ simulations (coloured lines) __ triangles shows relationship between ice-shelf area and atmospheric warming 


under each of the four RCP scenarios as well as one that includes for the near-equilibrium response and for intermediate stages (dotted lines). 
2 X amplification of Antarctic temperatures by 2300 cz (darker shading), based _—_All curves in b are based on data from the four RCP scenario simulations, as 
on coeval climatic and oceanic perturbations. The forced response (grey well as one that includes 2 X amplification of Antarctic temperatures by 
shading) represents 20% to 42% of the committed response by 5000 cE. Lighter 2300 cE, and two additional experiments whose maximum air temperature 
shading between coloured lines shows rates of sea-level-equivalent ice loss forcings are 1.5 °C and 3.35 °C. Pink shading defines the temperature range 
for each scenario. b, Long-term sea-level commitment as a function of within which an ice-shelf extent less than 50% of present is simulated. 


atmospheric warming (blue shading with squares). Intermediate response 
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Extended Data Table 1 | CMIP5 multi-model ensemble mean 
environmental perturbations 


air temp. anomaly 
(°C) 

precip. anomaly 

(%) 

ocean temp. anomaly 
(°C) 


Global 
Antarctic 
Global 
Antarctic 
Global 
Antarctic 


RCP 2.6 


RCP 4.5 RCP6.0 RCP 8.5 


2.0 2.6 4.7 
1.6 2.3 4.2 
10.9 13.2 22.6 
9.4 13.4 24.5 
wa 1.4 2.6 
0.4 0.5 1.0 


Air temperature, precipitation and ocean temperature changes (ATzir, APe# and ASST) by 2100 for each 
of the RCP scenarios, expressed as perturbations from present. 
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Extended Data Table 2 | Parameter values found to best reproduce 
present-day Antarctic ice-sheet 
configuration 


Parameter Value Units 
Domain resolution (2, 7) 20 or 10 km 
Ice grid resolution (z) 0.024 km 
Bedrock grid resolution (z) 0.1 km 
Air temperature lapse rate -0.008 Tm! 
SIA enhancement 1.2 or 1.47 - 
SSA enhancement 0.5 or 0.57 - 
Till porewater fraction 0.8 7 
Eigen calving coefficient 5e+17 - 
Thickness calving limit 220 m 
Density of lithosphere 3300 kgm? 
Flexural rigidity 5x 1074 Nm 
Viscosity of mantle ixi0” Pas 
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Note that different values of enhancement of shallow-ice and shallow-shelf approximations (SIA and 
SSA) are used for the 10-km- and 20-km-resolution simulations. 
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Inequality and visibility of wealth in experimental 


social networks 


Akihiro Nishi!?, Hirokazu Shirado’?, David G. Rand>** & Nicholas A. Christakis’? ° 


Humans prefer relatively equal distributions of resources’*, yet 
societies have varying degrees of economic inequality®. To invest- 
igate some of the possible determinants and consequences of 
inequality, here we perform experiments involving a networked 
public goods game in which subjects interact and gain or lose wealth. 
Subjects (n = 1,462) were randomly assigned to have higher or lower 
initial endowments, and were embedded within social networks 
with three levels of economic inequality (Gini coefficient = 0.0, 
0.2, and 0.4). In addition, we manipulated the visibility of the wealth 
of network neighbours. We show that wealth visibility facilitates 
the downstream consequences of initial inequality—in initially 
more unequal situations, wealth visibility leads to greater inequality 
than when wealth is invisible. This result reflects a heterogeneous 
response to visibility in richer versus poorer subjects. We also find 
that making wealth visible has adverse welfare consequences, yield- 
ing lower levels of overall cooperation, inter-connectedness, and 
wealth. High initial levels of economic inequality alone, however, 
have relatively few deleterious welfare effects. 

The unequal distribution of wealth in modern societies probably 
arose after we abandoned the relatively possession-free existence of 
hunter-gatherers’”’, and it reflects several processes: individual vari- 
ation in inborn traits (such as abilities, desires), differential access to 
environmental resources, and differential accumulation of wealth 
through transactions. Despite such inequality, humans have strong 
egalitarian preferences’°. What forces, then, lead to the emergence 
and maintenance of economic inequality? And what are the welfare 
implications of this inequality? We shed light on these questions using 
laboratory experiments that explore macro-level dynamics of eco- 
nomic inequality arising from micro-level cooperative interactions 
of individuals embedded within dynamic social networks'®"'*. We 
focus on two dimensions: (1) initial conditions of wealth inequality 
(as a proxy for variation in initial endowments or private access to 
environmental resources), and (2) the local visibility of wealth. 

We carried out a series of experiments with 1,462 subjects, divided 
among 80sessions lasting an average of 30.0 minutes (s.d. = 7.13). 
Subjects were placed in groups with an average size of 17.21 
(s.d. = 2.79) and arranged in a social network with an Erddés-Renyi 
random graph configuration in which 30% of ties were present (see 
Supplementary Information)'*'""’; subjects were therefore initially 
connected to an average of 5.33 (s.d. = 0.98) neighbours. The subjects 
played a cooperation game lasting 10 rounds with their neighbours. In 
each round, all subjects chose whether to cooperate, by reducing their 
own wealth by 50 ‘units’ per neighbour in order to increase the wealth 
of all neighbours by 100 units each, or to defect by paying no cost and 
providing no benefits. Subjects made the same choice with all their 
neighbours. These interactions constituted the economic transactions, 
affecting each individual’s wealth and thus resulting in population- 
level changes in overall wealth and inequality. The arbitrary units were 
converted to real money at the end of the game (see Supplementary 
Information). 


After making their cooperation choice, subjects were informed 
of the choices made by their neighbours. Then, subjects had the 
opportunity to change their neighbours by making or breaking ties. 
Specifically, 30% of all pairs of subjects were chosen at random in each 
round and given the opportunity to rewire their networks (this set-up 
was fixed across all manipulated conditions)'®". Ifa tie already existed 
between the two subjects, then one of the two was picked at random to 
be allowed to choose whether to voluntarily break the tie with the 
other; if a tie did not already exist between the two, both of them were 
given the option to form a tie and, if both approved, a new tie was 
formed. When making this decision, subjects were aware of whether 
the person to whom they might disconnect or connect had cooperated 
or defected in the past round. Thus, people could choose to alter a new 
subset of their social ties at each round; the network could be rewired; 
and subjects’ network degree (number of directly connected neigh- 
bours) and transitivity (the probability that any two of a focal subject’s 
neighbours are themselves connected) could change. 

Within this basic setup, we then manipulated initial wealth inequality 
and wealth visibility (Extended Data Table 1 and Extended Data Figs 1 
and 2). To manipulate initial wealth inequality, subjects were randomly 
assigned to one of three conditions. In the ‘no initial inequality’ con- 
dition, each subject started with the same initial endowment of 
500 units. In the other two conditions, there was initial wealth inequal- 
ity, such that ‘rich’ subjects received a larger initial endowment than 
‘poor’ subjects. The endowments of the rich and poor were set to dif- 
ferent levels of inequality such that the expected Gini coefficient (see 
Supplementary Information)" at the beginning was either 0.0 (no initial 
inequality), 0.2 (low initial inequality), or 0.4 (high initial inequality). 
Importantly, the overall per capita initial wealth in all groups was equi- 
valent (that is, 500 units); only the distribution of wealth varied. Subjects 
were randomly assigned to be rich or poor within the low and high 
initial inequality conditions, and they were randomly assigned to one of 
the nodes in the randomly generated network regardless of their endow- 
ment (see Fig. 1 for illustration, and also Supplementary Video 1). 

Independent of baseline inequality, we also manipulated the visibil- 
ity of local neighbours’ wealth. In the ‘invisible’ (private) condition, 
subjects only knew their own accumulated wealth. In the ‘visible’ 
condition, subjects could see their own accumulated wealth as well 
as the accumulated wealth of each of their directly connected neigh- 
bours. Subjects were informed whether each of their neighbours coop- 
erated or not, regardless of the visibility condition of neighbours’ 
wealth. In both the visible and invisible set-ups, subjects had only local 
knowledge about their immediate neighbours and not global know- 
ledge about the whole network. 

Initial wealth inequality and visibility had joint and several effects on 
game dynamics. We begin by considering the persistence of wealth 
inequality (Fig. 2). Although the Gini coefficients in the invisible con- 
ditions converge at a low level (of roughly 0.16) by the later rounds, the 
Gini coefficients in the visible conditions vary persistently and depend 
on the initial level of inequality. We see a substantial interaction effect 
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Figure 1 | The level of initial economic inequality and the visibility of 
connecting neighbours’ wealth information are experimentally 
manipulated in dynamic human social networks. a-d, The states at round 0 
(before interactions start) and at round 10 (after they end) in 4 out of 

80 sessions (n = 1,462) are shown. The bold outline frame of a circular node 
indicates the ‘visible’ condition (wealth information is revealed to directly 
connected neighbours) and a non-bold outline frame indicates the ‘invisible’ 
condition (not revealed). Node size (area) indicates wealth (with bigger nodes 


between visibility and initial inequality on Gini over all rounds (two- 
way interaction P = 0.043; all P values determined using regression 
with standard errors clustered at the level of session and round; see 
Supplementary Table 2 and Supplementary Information for details). 
In the high initial inequality condition, making neighbours’ wealth 
visible results in significantly higher levels of inequality compared to 
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Figure 2 | Wealth visibility increases economic inequality (relative to 
invisibility) in the presence of initial inequality, but not in the presence of 
initial equality. The dynamics of the Gini coefficient in each of six settings 
(80 sessions total) is shown. Inset, the differences between the Gini coefficient 
over the ten rounds in the visible compared to the invisible condition (in the 
form of regression coefficients; see Supplementary Information). Error bars, 
mean + s.e.m. NS for P= 0.05, *P < 0.05, and **P< 0.01. 
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being richer). The letter in the node denotes the initial wealth to which subjects 
were randomly assigned: N is an initially non-poor/non-rich subject (in the 
no initial inequality condition), P is an initially poor subject, and R is an initially 
rich subject. Node colours represent the last move (purple, cooperate; red, 
defect; grey, no history). The Gini coefficient is also indicated (higher is more 
unequal). The Gini coefficient (for disposable income) is presently roughly 0.26 
in Scandinavia and 0.39 in the United States. One of the three treatment 
conditions in our experiments (low initial inequality) is not shown. 


when neighbours’ wealth is invisible (difference in final round 
Gini = 0.104, P = 0.004) (Fig. 2, inset). In the low initial inequality 
condition, visibility again results in significantly higher inequality 
compared to the invisible condition, but to a lesser degree (difference 
in final round Gini = 0.0387, P = 0.041). Conversely, in the no initial 
inequality condition, visibility does not affect inequality (difference in 
final round Gini = 0.0185, P = 0.450). Thus, visibility serves to facil- 
itate the persistence of whatever relative level of wealth inequality is 
initially present in the system, compared to what would have happened 
without visibility. 

Examining groups of initially rich and poor subjects separately, we 
find that those individuals who are initially rich tend to be rich at the 
end, and, similarly, those who are initially poor tend to be poor at 
the end, regardless of whether the initial Gini coefficient is 0.2 or 0.4 
(Extended Data Fig. 3). Although—in both the visible and invisible 
conditions—wealth distributions of initially rich and poor subjects 
gradually overlapped as the level of earned wealth increases in later 
rounds, few reversals of fortune occurred at the individual level (as also 
seen in labour markets’*). 

Turning to levels of average population wealth, we find that visibility 
has a substantial negative effect (Fig. 3a and Supplementary Table 4): 
despite the same payoffs and rules across conditions, overall wealth is 
significantly lower in the visible conditions compared to the invisible 
conditions (regression model coefficient = —489.6, P = 0.001). The 
level of initial inequality is also negatively associated with overall 
wealth (coefficient = —669.6, P = 0.019). 

To further understand how visibility and inequality affect social 
welfare, we also examined cooperation and social tie formation. We 
find that the negative effect of visibility upon wealth accumulation is 
driven by a combination of two factors. First, cooperation rates are 
lower in the visible condition than the invisible condition (difference in 
cooperation frequency = —0.208, P< 0.001; Fig. 3b and Supplemen- 
tary Table 4), and do not differ based on the initial inequality—with a 
hypothetical change in the Gini coefficient from 0 to 1 being associated 
with a difference in cooperation frequency = —0.084, P= 0.445. 
Second, there is lower social connectivity in the visible condition 


15 OCTOBER 2015 | VOL 526 | NATURE | 427 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Initial inequality 


Wealth visibility 


— None Low —High - -Invisible — Visible 
a b 4.00, 
= 
xe) 
o -=i 
s 5 0.75 +. - $n4. 
5 can e 
= © 0.504 — 
® i) —— 
= c 
o 2 0.254 
> < 
. 2 
© 0.004 
a 
0 2 4 6 8 10 2 4 6 8 10 
Round d Round 
eer 1.001 
ze 
% = 
2 7 
dD) Cc 
a © 
ne) = 
x x 
3 ° 
2 z 
S 25, 3 
0.00 1 1 1 1 0.00 ; : * r 1 
0 2 4 6 8 10 0 2 4 6 8 10 


Round Round 


Figure 3 | Visibility of wealth undermines social welfare. a-d, Changes in 
average wealth (a), cooperation rate (b), network degree (number of connecting 
neighbours) (c), and network transitivity (probability of a focal subject’s 

two neighbours being connected) (d), across rounds are shown (80 sessions 
total). Error bars, mean + s.e.m. 


than in the invisible condition (difference in average degree = -0.991, 
P= 0.012; Fig. 3c, Supplementary Table 4 and Extended Data Fig. 4), 
and there is no difference based on the initial inequality—with a 
hypothetical change in the Gini coefficient from 0 to 1 being associated 
with a difference in degree = -1.71, P = 0.167. Visibility also seems to 
affects transitivity (difference in average transitivity = —0.096, 
P<0.001; Fig. 3d); however, after accounting for the rise in degree 
across rounds, and across treatments, neither visibility nor initial 
inequality affects transitivity (Extended Data Fig. 5b). 

As average wealth (and overall group wealth) is roughly the mul- 
tiplicative consequence of the cooperation rate and number of con- 
necting neighbours, the dynamics of average wealth (Fig. 3a) can be 
explained by these dynamics of cooperation and degree. When we 
additionally explored these findings at the individual level, we found 
that subjects have a larger degree in the invisible condition as a con- 
sequence of there being a larger proportion of attractive neighbours 
(that is, cooperators at the last move) available in the social network 
(Supplementary Table 8). That is, visibility reduces cooperation which 
in turn reduces the appeal of social connections. 

Although the dynamics of cooperation and degree can together 
explain the rise in average wealth, the macro-level dynamics of eco- 
nomic inequality that we observe in Fig. 2 require more micro-level 
analysis to fully explain. That is, the cooperation behaviour observed in 
Fig. 3b, when multiplied by the number of social connections shown in 
Fig. 3c, can explain the wealth shown in Fig. 3a, but it cannot explain 
the inequality shown in Fig. 2. Hence, to understand the dynamics of 
inequality, we examined how subjects exhibit different behavioural 
patterns of cooperation depending on their own wealth and on the 
average wealth of their neighbours, providing an individual-level 
understanding of the effect of visibility on inequality dynamics. 

Figure 4 shows important heterogeneity in individual-level beha- 
viours. When neighbours’ wealth is visible, the level of initial inequality 
has a noticeable effect on how a subject’s relative wealth affects the 
subject’s cooperation (Fig. 4, right). In the high initial inequality con- 
dition, subjects who are locally and presently (that is, in the current 
round) richer than the average of their neighbours are less likely to 
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Figure 4 | Wealth visibility leads to exploitation under initial inequality, but 
fairness under initial equality. When wealth is visible (right), subjects richer 
than the average of their neighbours are more likely to cooperate in the ‘no 
initial inequality’ condition (blue, fairness scenario), but less likely to cooperate 
in the high initial inequality condition (orange, exploitation scenario). This 
behavioural pattern is not observed when connecting neighbours’ wealth 
information is invisible (left). Shown are beta coefficients from logit models. 
P values indicate whether the coefficients are statistically different from 0.0. 
Error bars, point estimate + standard error. NS for P= 0.05, *P < 0.05 

***P < 0.001. See Supplementary Information for details. 


cooperate compared to those who are locally and presently poorer 
(regression model coefficient = —0.633, P<0.001). Moreover, we 
observe that this tendency is driven largely by richer-than-average sub- 
jects who defected in the prior round (coefficient = -0.997, P< 0.001; 
Extended Data Fig. 6b); in an initially unequal world, we observe that 
defectors who are presently richer than connecting neighbours keep 
defecting and tend not to change their behaviour. This leads to an 
‘exploitation’ scenario: poorer subjects are more likely to cooperate 
and invest in their local network, making them worse off relative to their 
neighbours and allowing the rich to get richer. As a result, richer subjects 
outperform poorer neighbours, leading to the increase in economic 
inequality (relative to the invisible condition) observed at the macro level. 

Conversely, in the no initial inequality condition, we observe that 
subjects who are locally and presently richer than their neighbours are 
more likely to cooperate compared to poorer subjects (regression 
model coefficient = 0.370, P = 0.027). Furthermore, this tendency is 
driven largely by richer subjects who cooperated in the prior round 
(coefficient = 0.805, P = 0.002; Extended Data Fig. 6a); in an initially 
equal world, cooperators who are presently richer than connecting 
neighbours keep cooperating. This leads to a ‘fairness’ scenario in which 
the wealth of richer subjects is invested in their local network, allowing 
poorer neighbours to gain wealth. Thus, poorer subjects have the 
opportunity to catch up, and wealth visibility does not increase eco- 
nomic inequality relative to the invisible condition. Moreover, when 
considering mean difference in wealth, rather than Gini, this fairness 
behaviour leads to an actual reduction in inequality under visibility; see 
Extended Data Fig. 5a. As such effects are detected when connecting 
neighbours’ wealth information is visible, but not when it is invisible 
(three-way interaction P = 0.004, Supplementary Table 7; P> 0.05 
for all coefficients in the invisible condition in Fig. 4, left), these indi- 
vidual-level behaviours help to explain the macro-level results of our 
experiments. Moreover, agent-based simulations show that these pat- 
terns are sufficient to reproduce the observed inequality dynamics (see 
Supplementary Information and Extended Data Figs 7, 8 and 9). 

In summary, we find that making wealth visible abets the persist- 
ence of experimentally induced inequality, compared to identical 
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circumstances where wealth is invisible. We also find that visibility has 
a corrosive overall effect on our laboratory ‘societies’, reducing overall 
cooperation, interconnectedness, and wealth. Thus, our experiments 
demonstrate that wealth visibility may be an important societal force, 
negatively affecting the dynamics of wealth and inequality, as well as 
social structure and cooperation. Surprisingly, our results are quite 
different with respect to the effect of initial wealth inequality. 
Rather than inequality being an “enemy of cooperation’, we find, in 
this setting, that inequality alone has relatively little effect on coopera- 
tion, interconnectedness or overall wealth accumulation. Thus, it is not 
inequality per se that is so problematic, but rather visibility that 
adversely affects cooperation here, regardless of what can be seen (that 
is, regardless of whether subjects are surrounded by an initially equal 
or unequal economic distribution). 

Prior work regarding the role of inequality in contributions to public 
goods has reported mixed results'*”°, and the role of inequality in the 
evolution of cooperation has not been fully understood”. Insofar as 
it is not inequality per se that affects cooperation in our experiments, 
but rather visibility, our results help shed light on these findings. Our 
results may also be relevant to norms in hunter-gatherer societies 
privileging less attachment to owned items® and less ostentation™*. It 
is noteworthy that any (limited) wealth that is possessed in foraging 
societies is necessarily visible. Hence, it may not only be the surplus 
that arose with the agricultural revolution and fixed human settle- 
ments that contributed to inequality, but also the possibility of con- 
cealment that may be key. The mere ability to choose to conceal or 
display wealth might be relevant to how much inequality and coopera- 
tion arise in social groups. 

Various psychological mechanisms may underlie the observed beha- 
viours. For example, visibility may invoke neurological and psycho- 
logical processes related to social comparison**”*”*, and visibility may 
cause subjects to perceive the situation as a competition”, to think that 
their wealth signals social position”, or to fear being near last place”, all 
of which might reduce cooperation. Our results are also consistent with 
findings regarding pay secrecy and worker productivity”. 

There are features potentially relevant to inequality that our experi- 
ments do not explore, for example: whether the initial resources are 
seen as earnings or windfalls; whether individuals producing public 
goods can earn more; whether the payoff structure, group size, net- 
work topology, or rewiring rate matter; or how peer sanctions or 
institutions (like taxation, courts or policing) affect the outcome. 
Another promising topic is the effect of allowing subjects to manip- 
ulate the visibility of wealth, in keeping with the theory of conspicuous 
consumption” and with notions of costly signalling. These are import- 
ant directions for future work. 

Although the results of laboratory experiments do not translate 
directly into the real world, the evidence presented here suggests that 
mechanisms that conceal personal wealth information might induce 
lower economic inequality, at least given an already high level of inequal- 
ity. Given the widespread availability of wealth information as well as 
opportunities and desires to acquire and display wealth in contemporary 
societies, however, this would clearly not be easy to do. Conversely, when 
economic inequality is low, similarity could be more publicized, though 
this might sacrifice population-level economic growth. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Extended Data Figure 1 | Lorenz curve description of the four wealth inequality in terms of the Gini coefficient. Condition A is equivalent to any 
distributions. We prepared four different wealth conditions (A-D). For other condition on the line from (0,0) to (1,1). Conditions B and C are analysed 
example, the shaded area for condition B divided by the area of the largest together since they did not yield different analytical results (see Supplementary 


isosceles right triangle represents the Gini coefficient for condition B (that is, Information). 
0.2). Any points on the same dotted line achieve the same level of economic 
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Extended Data Figure 2 | Rules in our experiments and the implied 
dynamics of the Gini coefficient and mean difference. The Gini coefficient 
is a relative measure of inequality, while the mean difference is an absolute 
measure of inequality. When we focus on a tie between two subjects (a richer 
ego and a poorer alter), there are four combinations in the choices of 
cooperation behaviours in a single round. For example, when a richer ego = C 
and a poorer alter = C (that is, when a richer ego cooperates and a poorer alter 
also cooperates), both of them pay 50 units, and obtain 100 units, in which case 
the payoff is +50 for both. As both of them get the same payoff, the mean 
difference between them does not increase or decrease (—>). On the other hand, 


the difference in wealth between them becomes less important in a relative 
manner, which leads to the reduction in the Gini coefficient between them (|). 
The behaviours of the mean difference and Gini coefficient vary for the 

four combinations. C represents cooperation, and D represents defection. GINI 
represents local Gini coefficient of the focal two individuals, and ‘Mean diff 
represents the mean difference in wealth of the two subjects. GINI or 

mean difference can show the following outcomes: does not change (—), 
increases (1), increases to a greater degree (77), decreases (|), or decreases to a 
greater degree (| |). 
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Extended Data Figure 3 | A majority of initially rich individuals stay 
wealthier than a majority of initially poor individuals over the ten rounds 
regardless of the initial conditions. a, b, The mean and standard error of 
mean (bar) of the average wealth of a group of initially rich individuals and of a 
group of initially poor individuals are calculated at each round for the visible 
condition (a) and for the invisible condition (b). Error bars, mean + s.e.m. 
c-f, For each round at each session, we standardized wealth of each individual 
(that is, (individual wealth-mean)/s.d.), and calculated the minimum (min), 
25th percentile (25th), median, 75th percentile (75th), and maximum (max) of 
the standardized wealth of a group of initially rich individuals and a group of 


0 2 4 6 
Round (High-level initial inequality, Invisible) 


Round (Low-level initial inequality, Invisible) 


initially poor individuals, separately. These figures show the trajectories of the 
mean of the five indicators (minimum, 25th, median, 75th, and maximum) 
among different sessions of the same initial condition (c for high-level initial 
inequality, visible; d for high-level initial inequality, invisible; e for low-level 
initial inequality, visible; and f for low-level initial inequality, invisible). Darker 
shades represent the mean of interquartile ranges (25th to 75th), lighter shades 
represent the mean of ranges (minimum to maximum), and the solid lines 
represent the mean of the median among the different sessions. Crossing of 
shades and medians between the two groups, if observed, implies the influence 
of the initial wealth difference on present wealth may be wiped out. 
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Extended Data Figure 4 | Cumulative degree distributions at the final inequality conditions at the same neighbours’ wealth information setting 
(tenth) round. The proportion of subjects who have at least k social ties (that is, none versus low, none versus high and low versus high) show those 
(degree) is calculated for each k (1 to 20) at each initial condition. Each distributions are not significantly different (Kolmogorov-Smirnov test, 


distribution of the three initial inequality conditions in the invisible setting is | P > 0.12) except none versus high in the invisible condition (Kolmogorov- 
significantly different from that in the visible setting (Kolmogorov-Smirnov Smirnov test, P = 0.030). The means of these distributions, by round, are shown 
test, P< 0.01), and has fatter tails. The pairwise comparison in different initial in Fig. 3c. 
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Extended Data Figure 5 | Changes in mean difference in wealth and in 
excess transitivity in the experimental conditions. a, The dynamics of mean 
difference in each of six settings is shown. Inset, the differences between 
mean difference at the first to tenth rounds in the visible compared to the 
invisible condition are shown separately for three different conditions of initial 
inequality (none, blue; low, grey; high, orange). Positive bars indicate that 
making neighbours’ wealth visible increases mean difference in wealth. b, The 
dynamics of excess transitivity (transitivity adjusted for network degree at 
each session) in each of the six settings is shown. (See Fig. 3d for the dynamics of 
transitivity unadjusted for degree.) As a larger degree naturally results in a 
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larger transitivity, we calculate the expected value of transitivity given a certain 
network degree and a certain size in a random graph in simulations (10,000 
iterations), and report the deviation of the observed transitivity from the 
expected transitivity (that is, observed transitivity minus expected transitivity). 
Inset, the differences between excess transitivity at the first to tenth rounds 
in the visible compared to the invisible condition are shown separately for three 
different conditions of initial inequality (none, blue; low, grey; high, orange). 
Negative bars indicate that making neighbours’ wealth visible decreases 
excess transitivity. Error bars, mean + s.e.m. NS for P= 0.05, *P < 0.05. 
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Extended Data Figure 6 | Additional results regarding behavioural 
mechanisms. a, b, Stratified results of Fig. 4 according to the prior move 
(cooperation or defection). Bars represent standard errors. c-f, Cooperation 
rate in the different conditions with respect to the variable of social comparison 
(ego (a focal individual) is richer or alters (given as the average of an ego’s 
connecting neighbours) are richer) for each setting. For example, e shows that 
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richer subjects are more likely to cooperate (82.9%) when the initial economic 
inequality is set to none in the visible condition. Panel f shows that richer 
subjects are less likely to cooperate (17.3%) when the initial economic 
inequality is set to high in the visible condition. Error bars, mean + s.e.m. 

NS for P= 0.05, **P < 0.01, ***P < 0.001. 
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Extended Data Figure 8 | Agent-based simulations reproduced the results average wealth, proportion of cooperation, and degree (interconnectedness) are 
that were observed in the online experiments with human subjects. shown, respectively. The medians (solid and dashed lines) and 90% confidence 
a-e, Results of agent-based simulations for Gini coefficient, mean difference, regions (shaded area, 5th percentile to 95th percentile) are presented. 
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Extended Data Figure 9 | Results of agent-based simulations with session up to 20 rounds show that the effect of visibility in Gini dynamics is robustly 
observed. a—e, The medians and 90% confidence regions (shaded area, 5th percentile to 95th percentile) are presented. 
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Extended Data Table 1 | Parameters for experimental settings 
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Please refer to the Lorenz curve in Extended Data Fig. 1 for the wealth conditions A, B, C and D. 


©2015 Macmillan Publishers Limited. All rights reserved 


Initial wealth Initial wealth Level of 
initial 

individuals economic 
inequality difference 


No 
Low 
Low 
High 

No 
Low 
Low 
High 


Expected 
initial 
mean 


LETTER 


Number 

“intial oF games 

GINI ae 
0.00 10 
0.20 10 
0.21 10 
0.41 10 
0.00 10 
0.20 10 
0.21 10 
0.41 10 


[A her 


doi:10.1038/nature15694 


Forniceal deep brain stimulation rescues hippocampal 
memory in Rett syndrome mice 


Shuang Hao, Bin Tang’, Zhenyu Wu!?, Kerstin Ure’, Yaling Sunt?, Huifang Tao, Yan Gao!?, Akash J. Patel), 
Daniel J. Curry’, Rodney C. Samaco', Huda Y. Zoghbi'*>>” & Jianrong Tang’? 


Deep brain stimulation (DBS) has improved the prospects for many 
individuals with diseases affecting motor control, and recently it has 
shown promise for improving cognitive function as well. Several 
studies in individuals with Alzheimer disease and in amnesic rats have 
demonstrated that DBS targeted to the fimbria—fornix’*, the region 
that appears to regulate hippocampal activity, can mitigate defects in 
hippocampus-dependent memory’ °. Despite these promising results, 
DBS has not been tested for its ability to improve cognition in any 
childhood intellectual disability disorder. Such disorders are a press- 
ing concern: they affect as much as 3% of the population and involve 
hundreds of different genes. We proposed that stimulating the 
neural circuits that underlie learning and memory might provide 
a more promising route to treating these otherwise intractable 
disorders than seeking to adjust levels of one molecule at a time. 
We therefore studied the effects of forniceal DBS in a well-char- 
acterized mouse model of Rett syndrome (RTT), which is a leading 
cause of intellectual disability in females. Caused by mutations that 
impair the function of MeCP2 (ref. 6), RTT appears by the second 
year of life in humans, causing profound impairment in cognitive, 
motor and social skills, along with an array of neurological fea- 
tures’. RTT mice, which reproduce the broad phenotype of this 
disorder, also show clear deficits in hippocampus-dependent learn- 
ing and memory and hippocampal synaptic plasticity* ". Here we 
show that forniceal DBS in RTT mice rescues contextual fear mem- 
ory as well as spatial learning and memory. In parallel, forniceal 
DBS restores in vivo hippocampal long-term potentiation and 
hippocampal neurogenesis. These results indicate that forniceal 
DBS might mitigate cognitive dysfunction in RTT. 

A deficit in contextual fear memory is one of the most reproducible 
and reliable outcome measures among RTT mouse models”. 
Specifically, female Mecp2*’~ mice (hereafter referred to as RTT mice) 
have impaired contextual fear memory when tested 24 h after train- 
ing'’. Because this deficit is readily quantifiable and accessible, we used 
fear memory as our first test of the effect of forniceal DBS in freely 
moving RTT mice. We implanted DBS electrodes in the fimbria-—for- 
nix (FFx) of 6- to 8-week-old RTT mice and wild-type controls 
(Extended Data Fig. 1), guiding electrode placement with FFx-evoked 
potentials in the dentate gyrus (Fig. la-c). We divided mice into four 
groups after recovery: wild-type, sham; wild-type, DBS; RTT, sham; 
and RTT, DBS. Mice in both DBS groups received daily DBS treatment 
for 14 days while the two sham groups experienced the same proce- 
dures except for DBS. Based on widely used parameters for DBS in the 
clinic’” and in rodents"’, along with our own pilot testing, we set DBS at 
130 Hz, 60 1s pulse duration, and 1 h per day. Stimulus intensities were 
individually optimized to 80% of the threshold that elicits an instance 
of afterdischarge in the hippocampus’. No seizures appeared under 
these DBS parameters. Three weeks after completing the two-week 
DBS protocol, we performed behavioural testing and subjected the 


mice (now aged 14 weeks, Extended Data Fig. 1) to a fear conditioning 
paradigm to examine contextual fear memory and cued fear memory 
(see Methods). 

Forniceal DBS significantly enhanced contextual fear memory in 
both wild-type (3 h, day 1 and day 3, P< 0.05) and RTT mice (3 h and 
day 1, P< 0.05; Fig. 1d). In fact, DBS restored contextual fear memory 
in RTT mice to wild-type levels: there was no difference between 
the DBS-treated RTT mice (3h, 47.56 + 4.22%; day 1, 47.84 + 4.16%) 
and sham-treated wild-type mice (3h, 44.87+ 3.60%; day 1, 
45.97 + 3.69%). Notably, forniceal DBS did not alter cued fear mem- 
ory (Fig. le), even though the FFx also projects to the amygdala’>. All 
the mice that received DBS/sham treatment responded to tone pre- 
sentation (Extended Data Fig. 2e-h), but less than the animals that 
were implanted and did not experience the 2-week DBS/sham proce- 
dures (Extended Data Fig. 2b-d). Further analysis indicated that the 
longer period of handling and exposure (for example, daily trans- 
portation, connection/disconnection of the wires, and staying in the 
DBS/sham chamber for 1 h per day) increased the motor activity and 
decreased the anxiety levels in DBS/sham-treated mice (Extended 
Data Fig. 3). These changes likely reduced the fear responses to the 
tone, and the conditioning context, in general (Fig. 1d, e and Extended 
Data Fig. 2a, b). 

Forniceal DBS did not improve levels of locomotion, anxiety, pain 
threshold or motor learning (Extended Data Figs 3 and 4a, b) as well as 
motor coordination, social behaviour and body weight in RTT mice, 
although there were differences between RTT mice and wild-type 
controls in these features (Extended Data Figs 4c-f and 5a, b). 
Forniceal DBS thus specifically rescued contextual memory impair- 
ment in RTT mice without evident off-target effects. 

To determine whether forniceal DBS would improve spatial cog- 
nition, which is also hippocampus-dependent, we trained new cohorts 
of mice, who received the same DBS/sham procedures, in a hidden 
platform version of the water maze task’® (Extended Data Fig. 1). 
Sham-treated RTT mice needed more time than sham-treated wild- 
type mice to locate the hidden platform across the training trials, spent 
less time in the target quadrant, and had fewer platform area crossings 
in the probe test (Fig. 2a). In wild-type mice, DBS significantly 
enhanced spatial learning compared to the sham group (Fig. 2b). 
Treatment made no difference during the probe test, probably because 
of a ceiling effect in sham-treated wild-type animals. We observed an 
even stronger effect of DBS in RTT mice: forniceal DBS enhanced not 
only spatial learning but also spatial memory retrieval (Fig. 2c). Again, 
the rescue was so strong that there was no difference between DBS- 
treated RTT and sham-treated wild-type groups in latencies to the 
hidden platform, time in target quadrant, or platform area crossings 
(Fig. 2d). Visible platform training confirmed that neither MeCP2 level 
nor forniceal DBS altered visual or sensorimotor skills (Extended Data 
Fig. 5c-e). 
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Figure 1 | Forniceal DBS restores contextual fear memory in RTT mice. 

a, b, Photomicrographs illustrating DBS electrode placement (arrowheads) in 
the FFx (a) and the recording electrode in the dentate gyrus (b). cc, corpus 
callosum; LV, lateral ventricle; D3V, dorsal third ventricle; DG, dentate gyrus. 
c, Representative evoked potential trace of the FFx pathway recorded in the 
dentate. d, Forniceal DBS enhanced contextual fear memory in both wild-type 
(WT) and RTT mice (DBS-treated wild-type mice, n = 21; sham-treated wild- 
type mice, n = 21; DBS-treated RTT mice, n = 17; sham-treated RTT mice, 
n= 14). There were significant main effects on freezing time among the four 
groups (two-way repeated-measures ANOVA: group, F3,69 = 5.67, P = 0.002; 
day, F3,139 = 6.44, P< 0.001; group X day interaction, Fo,139 = 2.15, 

P= 0.027). Within-genotype analysis revealed a significant DBS effect in both 
wild-type (F,,49 = 8.50, P = 0.006) and RTT mice (F;,35 = 6.44, P = 0.016). 
DBS in RTT mice restores contextual fear memory to wild-type levels (DBS- 
treated RTT mice vs. sham-treated wild-type mice: group, F136 = 2.76, 

P= 0.105). Comparison of contextual fear memory on day 1 among the four 
groups revealed a significant main effect (two-way ANOVA followed by 
Tukey’s post hoc test: genotype, F),69 = 8.39, P = 0.005; treatment, 

F, 69 = 11.41, P = 0.001). e, Cued fear memory of mice tested in d. There was no 
difference in cued fear memory between groups over any time point (main 
effect: group, F3,69 = 0.88, P = 0.456; day, F3,139 = 1.65, P = 0.179; group X day 
interaction, Fo,1g9 = 0.89, P = 0.538) or on day 1 (genotype, F)69 = 0.64, 

P= 0.428; treatment, F),69 = 0.11, P = 0.741). *P < 0.05.; NS, not significant. 
Data presented as mean + s.e.m. Scattergrams show individual values. 


Since the RTT mice used in this study are impaired in both hip- 
pocampus-dependent memory" and in vitro hippocampal long-term 
potentiation (LTP)*, they provide an ideal setting in which to examine 
whether DBS alters synaptic plasticity. We implanted RTT and wild- 
type mice with DBS electrodes in the FFx, stimulation electrodes in 
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Figure 2 | Forniceal DBS rescues spatial learning and memory in RTT mice. 
In the water maze task, all mice were trained with a hidden platform for 9 days 
followed by a probe test without the platform 24h after the last training. 
There were significant main effects of escape latencies among the four 
groups (n = 18 mice per group) during acquisition training (two-way repeated- 
measures ANOVA: group, F3.6g = 20.74, P< 0.001; day, Fgs44 = 19.72, 
P<0.001). a, Sham-treated RTT mice showed increased escape latencies 
during training (genotype, F)34 = 35.30, P< 0.001; day, Fg72 = 7.06, 
P<0.001), but decreased time in target quadrant (P < 0.01) and fewer 
platform area crossings during the probe test (P< 0.001) than sham-treated 
wild-type controls. b, Forniceal DBS decreased escape latencies during training 
in DBS-treated wild-type mice compared to sham-treated wild-type 

controls (treatment, Fy34 = 5.94, P = 0.020; day, Fg27. = 17.10, P< 0.001; 
treatment X day, F272 = 2.19, P = 0.028). There was no difference in time 
spent in the target quadrant (P > 0.05) or in the number of platform area 
crossings (P > 0.05) between DBS and sham groups during the probe test. 

c, The DBS-treated RTT mice showed shorter escape latencies during training 
(treatment, F) 34 = 10.31, P = 0.003; day, Fg27. = 6.13, P< 0.001) but more 
time in the target quadrant (P< 0.05) and platform area crossings (P < 0.05) 
during the probe test than sham-treated RTT controls. d, There was no 
difference between DBS-treated RTT mice and sham-treated wild-type controls 
in escape latencies during training (group, F,,34 = 2.91, P = 0.097; group x day 
interaction, Fs7. = 0.80, P = 0.606), time in the target quadrant (P > 0.05), or 
number of crossings of the platform area (P > 0.05) during the probe test. 
*P< 0.05, **P < 0.01, ***P < 0.001; NS, not significant (Tukey’s post hoc test 
during acquisition; two-tailed unpaired t-test between groups and two-tailed 
paired t-test within group during probe). Data presented as mean = s.e.m. 
Scattergrams show individual values. 


the perforant path for the LTP test, and a recording electrode in the 
dentate gyrus (using evoked potentials as a guide). As with the beha- 
vioural studies described above, the mice underwent 2 weeks of DBS 
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followed by a 3-week interval before LTP testing (Extended Data 
Fig. 1). These DBS/sham procedures did not alter the hippocampal 
neural excitability (Extended Data Fig. 6a). LTP was induced on day 0 
and monitored across 5 days after induction in awake, freely moving 
mice. We quantified the population spike amplitude in each of the four 
groups of animals” (Fig. 3a). Neither DBS nor sham treatment altered 
baseline synaptic transmission in sham-treated wild-type, sham- 
treated RTT or DBS-treated wild-type mice, but DBS slightly reduced 
the magnitude of the evoked responses in DBS-treated RTT animals 
(Extended Data Fig. 6b). There was no difference in the stimulus 
intensities used for LTP induction among the four groups (wild-type, 
sham, 83.91 + 13.29 1A; wild type, DBS, 69.50 + 7.58 LA; RTT, sham, 
73.50 + 14.58 nA; RTT, DBS, 68.62 + 13.87 LA; P > 0.05). As expected, 
sham-treated RIT mice showed impaired in vivo LTP (1h after induc- 
tion, 160.37 + 14.08%) compared to sham-treated wild-type controls 
(256.06 + 27.50%) (Fig. 3b). Forniceal DBS, however, enhanced LTP 
in both wild-type mice (Fig. 3c; DBS, 381.75 + 26.13%; sham, 
256.06 + 27.50%) and RTT animals (Fig. 3d; DBS, 254.93 + 22.96%; 
sham, 160.37 + 14.08%) to the degree that there was no difference 
between DBS-treated RTT and sham-treated wild-type groups 
(Fig. 3e). Forniceal DBS thus restored hippocampal LTP in the per- 
forant path/dentate pathway in RTT mice. 

Because hippocampal neurogenesis contributes to hippocampal 
LTP and hippocampus-dependent memory’*”®, and because DBS in 
other afferent pathways of the hippocampus increases dentate neuro- 


genesis*~”’, we explored whether forniceal DBS might exert its effects 
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by stimulating hippocampal neurogenesis. We first observed that two- 
hour unilateral forniceal DBS stimulated the activity of dentate neu- 
rons, as indicated by increased expression of the immediate early gene 
Fos (Extended Data Fig. 7a). Each day after DBS over the 2 weeks of 
DBS/sham treatment, we injected RTT and wild-type mice with 
5-bromo-2’-deoxyuridine (BrdU) to mark newborn cells. We quan- 
tified dentate neurogenesis by the numbers of cells positive for BrdU, 
DCX (doublecortin, to label the immature neurons), or double labelled 
in each of the four groups (Fig. 4a—d). We found that baseline levels of 
dentate neurogenesis in sham-treated RTT mice (BrdU, 571 + 99.84; 
DCX, 914 + 242.76) were significantly lower than in sham-treated 
wild-type controls (BrdU, 1,218 + 175.57; DCX, 2,059 + 381.49) 
(Fig. 4e, f). Forniceal DBS, however, bilaterally enhanced dentate neu- 
rogenesis in both RTT (DBS-treated RTT: BrdU, 2,012 + 269.24; DCX, 
3,146 + 340.04; BrdU/DCX, 1,453 + 187.69) and wild-type mice (DBS- 
treated wild-type: BrdU, 1,833 + 274.79; DCX, 3,686 + 426.92, BrdU/ 
DCX, 1,526 + 257.46) (Fig. 4e-g and Extended Data Fig. 7b) such that 
levels of neurogenesis in DBS-treated RTT mice were even higher than 
those in sham-treated wild-type controls. 

Data suggest that decreased cholinergic signalling plays a role in 
RTT” and that forniceal stimulation enhances hippocampal memory 
through cholinergic modulation in rodents’. Therefore, we examined 
the effect of hippocampal infusion of muscarinic acetylcholine recep- 
tor antagonist atropine on the DBS benefit. There was no difference of 
fear memory between atropine and vehicle-treated groups in either 
RTT or wild-type mice, suggesting that DBS must benefit memory via 
additional mechanisms (Extended Data Fig. 8). 

DBS has been used to treat both motor and neuropsychiatric dis- 
orders in children”. There are a few case reports showing that pallidal 
DBS can resolve self-injurious behaviour in Lesch-Nyhan syn- 
drome”*’’, and hypothalamic DBS reduces aggressive behaviours in 
patients with intellectual disability**. To our knowledge, however, DBS 
has never been examined for cognitive benefits in the context of a 
childhood intellectual disability. In this study, we demonstrate that 
DBS improves contextual memory retrieval in a mouse model of 
Rett syndrome. Using a stimulation protocol that mimics clinical 


Figure 3 | Forniceal DBS rescues hippocampal synaptic plasticity in freely 
moving RTT mice. a, Superimposed traces of the perforant path recorded in 
the dentate gyrus 5 min before (grey) and 55 min after (black or red) tetani. 
b, Sham-treated RTT mice (n = 12) showed impaired LTP compared to the 
sham-treated wild-type group (n = 11) on day 0 (two-way repeated-measures 
ANOVA: genotype, F),2; = 11.34, P = 0.003; time, F)5,315 = 40.51, P< 0.001; 
genotype X time interaction, F,5 315 = 9.36, P< 0.001), day 1 (genotype, 

F, 2) = 7.46, P = 0.012; time, F363 = 5.15, P = 0.003), and day 2 (genotype, 
F, 2) = 6.50, P = 0.019). ¢, d, Forniceal DBS enhanced LTP in both wild-type 
and RTT mice (DBS-treated wild-type, n = 12; sham-treated wild-type, n = 11; 
DBS-treated RTT, n = 13; sham-treated RTT, n = 12). Two-way repeated- 
measures ANOVA revealed significant main effects of population spike 
amplitudes among the four groups on day 0 (group, F344 = 17.25, P< 0.001; 
time, F15,669 = 167.28, P< 0.001; group X time interaction, F45 669 = 14.50, 
P<0.001), day 1 (group, F344 = 21.53, P< 0.001; time, F3,132 = 7.69, 
P<0.001), day 2 (group, F344 = 16.21, P< 0.001; time, F3,132 = 8.96, 
P<0.001), and day 5 (group, F344 = 8.42, P< 0.001; time, F332 = 8.35, 
P<0.001). c, Forniceal DBS enhanced LTP in wild-type controls on day 0 
(treatment, F,.>, = 12.16, P = 0.002; time, F,5.315 = 121.93, P< 0.001; 
treatment X time interaction, Fi5,315 = 10.91, P< 0.001), day 1 (treatment, 

F, 2) = 18.99, P< 0.001; time, F363 = 4.77, P = 0.005), day 2 (treatment, 

F, 2) = 11.25, P = 0.003; time, F3,¢3 = 3.72, P = 0.016) and day 5 (treatment, 
F, 2; = 9.44, P = 0.006; time, F363 = 6.73, P< 0.001). d, Forniceal DBS 
enhanced LTP in RTT mice on day 0 (treatment, F, >; = 11.86, P = 0.002; time, 
F,5345 = 45.02, P< 0.001; treatment X time interaction, Fj5,345 = 10.31, 
P<0.001), day 1 (treatment, F, 23 = 14.60, P< 0.001; time, F369 = 2.91, 

P= 0.041) and day 2 (treatment, F; 23 = 19.45, P< 0.001; time, F3,69 = 6.09, 
P<0.001). e, There was no difference of LTP between DBS-treated RTT and 
sham-treated wild-type mice (P > 0.05 for all the test days). Arrows, LTP 
induction. *P < 0.05, **P < 0.01, ***P < 0.001; NS, not significant. Data 
presented as mean + s.e.m. 
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treatment, forniceal DBS clearly enhanced contextual fear memory as 
well as spatial learning and memory in both wild-type and RTT mice. 
DBS in the fimbria—fornix is so effective in the RTT mice that it 
restores hippocampus-dependent memory in both tasks to wild-type 
levels. We also found that forniceal DBS increases hippocampal syn- 
aptic plasticity and hippocampal neurogenesis, both of which are cent- 
ral to hippocampal learning and memory”*”’. 

Although this study was limited to hippocampus-based learning 
and memory, it is remarkable that DBS could exert any benefit in 
the face of such profound cognitive impairments as caused by RTT. 
Future work will explore additional DBS targets to determine the 
possible benefits of DBS on other RTT features such as dystonia and 
motor incoordination. Our studies lead us to suggest that DBS should 
be explored in other models of intellectual disabilities and eventually in 
human patients, particularly those conditions that cause more focal 
deficits in learning and memory. Intellectual disabilities as a group 
affect 2-3% of the population” and are at present untreatable; their 
molecular heterogeneity poses a daunting challenge to those looking 
for viable therapies. The fact that DBS is able to be modulated, revers- 
ible, and safe makes it an appealing candidate treatment that could 
potentially relieve a great deal of suffering. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Figure 4 | Forniceal DBS stimulates hippo- 
campal neurogenesis in wild-type and RTT mice. 
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DBS/sham treatment, animals were perfused for 
immunohistochemical detection of BrdU- and 
DCX-positive cells in the dentate gyrus. 

a-d, Representative images ipsilateral to the DBS/ 
sham treatment at low (top; scale bar, 100 jm) and 
high (bottom; scale bar, 50 j1m) magnification, 
showing BrdU‘ cells (green), DCX* cells (red) and 
the merge (yellow) from each of the four groups. 
New neurons were located in the innermost layer of 
the dentate gyrus. e-~g, Summary of immunore- 
active cell counting (n = 6 mice per group). Two- 
way ANOVA revealed a significant main effect 
on the numbers of BrdU™ cells (e; treatment, 

F 29 = 23.49, P<0.001), DCX* cells (f; genotype, 
F\ 20 = 5.65, P = 0.028; treatment, F),29 = 29.65, 
P<0.001), and BrdU* DCX* double-staining 
cells (g; treatment, F; 99 = 32.99, P< 0.001). 
Tukey’s post hoc test indicated that sham-treated 
RTT mice had fewer BrdU* (e) and DCX* (f) cells 
than sham-treated wild-type controls. Forniceal 
DBS increased the numbers of BrdU* (e), DCX* 
(f) and BrdU* DCX* double-staining (g) cells in 
DBS-treated wild-type and RTT mice compared to 
their respective sham-treated controls. *P < 0.05, 
**P < 0.01, ***P < 0.001. Scattergrams show 
individual values. 
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METHODS 

Animals. Adult female Mecp2*’” mice (14-16 weeks of age at the time of fear 
conditioning, water maze or LTP test) (Extended Data Fig. 1) on an FVB.129 
background were maintained on a 12h light:12 h dark cycle (light on at 7:00) with 
standard mouse chow and water ad libitum in our on-site AAALAS-accredited 
facility. They were group-housed up to five mice per cage before surgery and 
individually housed with nesting material in the cage after surgery in a room 
maintained at 22°C. All the experimental procedures and tests were conducted 
during the light cycle. Behavioural, electrophysiological, immunchistochemical, 
and pharmacological characterizations of the mice were performed and analysed 
blind to genotypes and/or treatments. All research and animal care procedures 
were approved by the Baylor College of Medicine Institutional Animal Care and 
Use Committee. 

Surgery and DBS. Mice were secured on a stereotaxic frame (David Kopf) under 
1-2% isoflurane anaesthesia. Bipolar DBS electrodes were constructed with 
Teflon-coated tungsten wire (bare diameter 50 um, A-M Systems) and the two 
tips were horizontally separated by 0.1-0.15 mm. The electrodes were targeted 
unilaterally to the fimbria-fornix (0.2mm posterior, 0.2mm _ lateral, and 
2.3-2.4 mm below the bregma) under the guidance of evoked potentials recorded 
in the ipsilateral dentate gyrus (1.8-2.0mm posterior, 1.4-1.6mm lateral of 
bregma, 2.2-2.3mm below the skull’'). All the electrodes together with the 
attached connector sockets were fixed on the skull by dental cement. Animals 
were given at least 2 weeks to recover. 

After recovery, mice were assigned into four groups (randomly to DBS or sham 
groups within the same genotype): wild-type, sham; wild-type, DBS; RTT, sham; 
and RTT, DBS. Animals in both DBS groups received 1 h DBS daily for 14 
consecutive days. Based on DBS parameters widely used in the clinic” and the 
cognitive assessment of DBS in rodents'’, the DBS was biphasic rectangular 
pulses (130 Hz, 60 ts pulse duration). This DBS pattern is used both in human 
subjects*'*** and rodents'*. The stimulus intensities were individually opti- 
mized to 80% of the threshold that elicits an afterdischarge in the hippocam- 
pus*"*. Animals in the sham groups experienced the same procedures as those 
in the DBS groups except without DBS. There was a 3-week interval between 
the last DBS administration and the beginning of fear conditioning, water maze, 
or in vivo LTP tests”*. All the mice received DBS/sham treatment and those used 
for fear conditioning and EEG recordings were habituated to the headstage/ 
wiring system in an environment different from both conditioning chamber 
and the cue memory test cage for 3 days (20 min per day) immediately before 
these tests. 

After finishing all experiments, mice were euthanized with an overdose of 
isoflurane. An anodal current (30 1A, 10s) was passed through the electrode wire 
to verify the electrode placements. Frozen 30-j1m coronal sections were cut and 
stained with cresyl violet. 

Behavioural tests. Tests of light/dark chamber and open field were conducted 
1 week before fear conditioning or water maze. Wire-hang, dowel-walk, rotarod 
and three-chamber tests, as well as a test of pain threshold, were performed 
1-2 weeks after fear conditioning or water maze. (Extended Data Fig. 1) For each 
test, mice were given at least 30 min to habituate after transport to the behavioural 
testing room before any tests were conducted. The light intensity of 150 1x and the 
background white noise at 60dB were presented during the habituation and 
throughout the testing periods. 

Fear conditioning. A delayed fear conditioning protocol was employed to evalu- 
ate hippocampus-dependent contextual fear memory and hippocampus-inde- 
pendent cue fear memory. On day 0 animals were trained in a mouse fear 
conditioning chamber with a grid floor that could deliver an electric shock 
(Med Associates, Inc.). This enclosure was located in a sound-attenuating box 
that contained a digital camera, a loudspeaker and a house light. Each mouse was 
initially placed in the chamber and left undisturbed for 2 min, after which a tone 
(30s, 5 kHz, 80 dB) coincided with a scrambled foot shock (2 s, 0.7 mA). The tone/ 
foot-shock stimuli were repeated after 1 min. After an additional 1 min, the mouse 
was removed and returned to its home cage. Fear memory retention was evaluated 
3h, 1 day, 3 days, and 7 days after training unless stated otherwise. At each time 
point, mice were first recorded for 3 min in the same chamber (cleaned with 70% 
ethanol) without tone. The mice were then tested in a novel cage (cleaned with 1% 
acetic acid) where a 3-min tone was presented after the animals had acclimated to 
the cage for 3 min. Mouse behaviour was recorded and scored automatically by 
ANY-maze (Stoelting). Freezing, defined as an absence of all movement except 
for respiration”’, was scored only if the animal was immobile for at least 1 s**. The 
percentage of time spent freezing during the tests serves as an index of fear 
memory. Cued fear memory was the subtraction of freezing time between the 
tone phase and the no-tone phase. Data are shown as mean + s.e.m. and analysed 
by two-way ANOVA followed by Tukey’s post hoc analysis. 
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Morris water maze. The Morris water maze was used to assess spatial learning and 
memory in RTT mice and the effect of forniceal DBS. This assay was performed as 
previously described with a few modifications****. The pool (120 cm in diameter) 
was filled with water (50 cm deep, 22-24 °C) made opaque with non-toxic white 
tempera paint. Visual cues were set on the wall of the testing room, at least 1 m 
from the pool edge. The ANY-maze tracking system (Stoelting) was used to track 
and analyse animal swimming. Mice were tested for their ability to find an escape 
platform (10cm in diameter) on three different components of the test: visible 
platform acquisition, hidden platform acquisition, and subsequent probe trial in 
the absence of the platform. In each case, the criterion for learning is an average 
latency of 15 s or less to locate the platform across a block of four consecutive trials 
(15s interval) per day. Mice are given up to 9 days to reach this criterion for 
learning. Twenty-four hours after the last training trial the mice were given a 
probe trial. During the probe trial, the platform was removed, and each animal 
was allowed 60s to search the pool. The amount of time and distance that each 
animal spent in each quadrant was recorded. The number of times a subject 
crossed the imaging location of the platform during training (platform crossing) 
was determined. Data of memory acquisition are expressed as mean + s.e.m. and 
analysed by two-way ANOVA with repeated-measures followed by Tukey’s post 
hoc analysis. Data of probe tests are shown as individual values and two-tailed t- 
test is used to compare the searching time in the target quadrants and the platform 
crossing numbers between groups, and two-tailed paired t-test for comparing the 
searching time in quadrants within groups. 

Open field. The open field apparatus consists of a clear, open Plexiglas box 
(40 X 40 X 30cm, Stoelting) with overhead camera and photo beams to record 
horizontal and vertical movements. Activity was quantified over a 30-min period 
by ANY-maze (Stoelting). Data are shown as mean + s.e.m. and analysed by two- 
way ANOVA followed by Tukey’s post hoc analysis. 

Light-dark box. The light-dark box assay was performed as published with few 
modifications*’. The box consisted of a clear Plexiglas chamber (40 X 20 X 30 cm) 
with an open top separated from a covered black chamber (40 X 20 X 30cm) bya 
black partition with a small opening (Stoelting). RTT and wild-type mice were 
placed into the illuminated side and allowed to explore freely for 10 min. An ANY- 
maze system with photo beam and overhead camera (Stoelting) was used to score 
the mice for the number and latency of entries and the time spent in each com- 
partment. The mouse must place 50% of body length into either the light or dark 
compartment to be scored as an entry. Data are shown as mean + s.e.m. and 
analysed by two-way ANOVA with Tukey’s post hoc analysis. 

Wire hang and dowel walk. These assays were performed as previously published 
with a few modifications**. RTT and wild-type mice were tested for motor coor- 
dination. For the wire-hang test the mouse was held by the tail and allowed to grasp 
with its forepaws the middle of a single 3-mm plastic-coated wire suspended 
15inches above a plastic-covered foam pad and released. For the dowel test the 
mouse was placed onto a 0.5-inch wooden dowel suspended 15 inches above a 
plastic-covered foam pad, and the total number of side touches and latency to fall 
were measured with a 120s cut-off. Data are shown as mean + s.e.m. and analysed 
using two-way ANOVA with Tukey’s post hoc analysis. 

Accelerating rotarod. This assay was performed as previously published with a 
few modifications**. RTT and wild-type mice were placed on the rotating cylinder 
of an accelerating rotarod apparatus (Ugo Basile) and allowed to move freely as the 
rotation increased from 5r.p.m. to 40 r.p.m. over a 5-min period. Latency to fall 
was recorded when the mouse fell from the rod or when the mouse had ridden the 
rotating rod for two revolutions without regaining control. Data are shown as 
mean + s.e.m. Latency to fall was analysed by two-way repeated-measures 
ANOVA with Tukey’s post hoc analysis. 

Three-chamber interaction. The test was performed as previously described*””. 
RTT and wild-type mice were used in this assay. For the habituation stage, test 
mice were placed in the middle chamber of the three-chamber apparatus (Ugo 
Basile) equipped with two empty, barred cages in the corners of the left and right 
chambers. They were allowed to explore freely for 10 min, with their movement 
tracked and recorded using the ANY-maze software pack (Stoelting), and inter- 
action time with each cage scored by an investigator blind to genotype and treat- 
ment group. For the social versus object stage, an age- and size-matched C57BL/6 
female mouse was placed in one cage and a Lego block of similar size was placed in 
the other cage. The test mouse was again placed in the middle chamber and 
allowed to explore freely for 10 min, with movement and interaction time recorded 
as before. Interaction time and time in each zone are shown as mean + s.e.m. and 
analysed by two-way ANOVA with Tukey’s post hoc analysis. 

Pain threshold. The test was performed as previously published with a few mod- 
ifications*’. At the end of the test battery, animals were placed into the condition- 
ing chamber. Every 30s, a 2-s scrambled electric foot shock with 0.05mA 
increments (starting from 0 mA) was applied. The shock current thresholds of 
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flinch, vocalization, and jumping were each recorded. Data are shown as mean + 
s.e.m. and analysed by two-way ANOVA with Tukey’s post hoc analysis. 
Induction and recording of hippocampal synaptic plasticity in vivo. To deter- 
mine the effect of forniceal DBS on hippocampal synaptic plasticity, an additional 
concentric stimulating electrode was implanted ipsilaterally in the medial perfor- 
ant path (0.2 mm posterior and 2.8-3.0 mm lateral of lambda, 1.0-1.3 mm below 
the dura). The stimulating and recording electrodes were surgically implanted as 
previously described*'” with the following modifications. The final depth of the 
electrodes was determined by electrophysiological guidance. A cortical silver ball, 
placed contralaterally, served as a recording reference as well as ground. Dental 
cement was used to anchor the electrode assembly that is connected to a unity gain 
preamplifier, and the connecting device for chronic recordings. After recovery 
from surgical implantation, mice were transported and habituated to the recording 
system during each of the 4 days before starting the LTP test. Signals were amp- 
lified (100 X), filtered (bandpass, 0.1-5 kHz), digitized at 10 kHz, and stored on 
disk for off-line analysis (pClamp10 and 1440A; Molecular Devices). To evaluate 
whether forniceal DBS influence the input-output (I/O) relation in the performant 
path-dentate pathway, I/O curves were generated for each mouse 1 day before and 
3 weeks after the DBS/sham treatment. For LTP evaluation, test responses elicited 
by 0.033 Hz monophasic pulses (0.1 ms duration) were recorded for 20-min per- 
iods on consecutive days at an intensity that evoked 40% of the maximal popu- 
lation spike. Following 2 days of stable baseline, a tetanus was delivered to the 
perforant path for LTP induction. Pulse width was doubled during tetani, which 
consisted of six series of six trains of six stimuli at 400 Hz, 200 ms between trains, 
20s between series. Responses were measured for 60 min after tetanus and again 
for 20 min at 24h, 48 h and 120h after tetanus. Since the latency of the population 
spike usually decreases following LTP induction, it is impractical to compare the 
initial slope of the fEPSP (field excitatory postsynaptic potential) before and after 
LTP induction in awake animals’. Accordingly, we quantified the amplitude of 
the population spikes’. Data were averaged every 5 min and normalized to the 
baseline measured over the 10 min before tetanic stimulation and presented as 
mean + s.e.m. Two-way repeated-measures ANOVA was used for data analysis. 
Recordings of hippocampal local field potentials (LFPs) and data analysis. The 
recording electrode of LFPs was targeted to the upper molecular layer of the 
dentate gyrus with the reference electrode in the corpus callosum. Recording of 
LFPs was conducted under matched behavioural states for RTT and wild-type 
mice. Signals were amplified (100X), filtered (bandpass, 0.1-5 kHz), digitized at 
2. kHz, and stored on disk for off-line analysis (pClamp10 and 1440A; Molecular 
Devices). The power spectrum of the LFPs was calculated at 0.244 Hz resolution, 
using the built-in function of pClamp 10. Then the relative power of hippocampal 
theta activity was normalized as the ratio between the power of the theta signal at 
4.15-11.96 Hz and the power of the signal at 0-100 Hz”. LFP recordings were 
excluded from further analysis if mice made large movements during recordings 
or electrical artefacts were evident’. 
Intracranial drug infusion and histology. Under isoflurane anaesthesia, custom- 
made 26G guide cannulas were implanted bilaterally (1.8 mm posterior, 1.2 mm 
lateral, and 1.2mm below the bregma) in RIT mice and littermate wild-type 
controls. Other than the DBS electrode, a recording electrode (Teflon-coated 
tungsten wire, 50 jum bare diameter) was unilaterally attached to the guide cannula 
for the recordings of evoked potential during surgical implantation and LFPs in 
the dentate. After 2 weeks of recovery, mice were randomly assigned into drug- or 
vehicle-treated groups. Atropine sulphate (Sigma) was dissolved in PBS (pH 7.4) 
and the solution was back-filled into a 33G injector. The solution of 0.5 pil drug 
(1.0 ug atropine)***’ or PBS vehicle per side was microinfused over 1 min into the 
dorsal hippocampus (dHP) through a pump (Harvard Apparatus)”. The tip of the 
injector was 1.0 mm below the guide tip and ~0.6 mm from the tip of the record- 
ing electrode. Following each injection, the injector was left in place for an addi- 
tional minute to allow drug diffusion. To determine the effect of atropine on 
hippocampal potentials evoked by FFx stimulation, atropine was infused into 
the dHP after 20 min of baseline test (0.033 Hz) and the responses were followed 
up for another 30 min in freely moving mice. To determine the effect of atropine 
on hippocampal theta activity, dentate LFPs were recorded for 5 min immediately 
before and another 5 min after atropine/vehicle infusion (10-15 min after the 
infusion). To evaluate the influence of atropine on DBS effect, RTT and wild-type 
mice receiving forniceal DBS were infused with atropine or vehicle 15 + 2 min 
before fear conditioning training. Then fear memory was tested 24 h after training. 
At the end of the experiment, 4% methylene blue in PBS (0.2 jl) was injected to 
each injection site. Mice were euthanized with an overdose of isoflurane. An 
anodal current (301A, 10s) was passed through the electrode wire to verify 
electrode placements. Frozen 30-|1m coronal sections were cut and stained with 
haematoxylin. Mice with blocked guide cannula(s) or with injection site(s) outside 
of the dorsal hippocampus were excluded from data analysis. 


Immunohistochemistry. To check whether forniceal DBS increased the activity 
of dentate neurons, we assessed expression of the Fos gene 60 min after 2 h of DBS 
or sham treatment. To evaluate whether DBS increases neurogenesis in the dentate 
gyrus, mice were injected with 5-bromo-2’-deoxyuridine (BrdU, Sigma-Aldrich, 
75mg kg ', i.p.) immediately after daily DBS or sham treatment for 12 days from 
the third day of the 2 weeks of DBS/sham treatment. 

Immunohistochemical staining. Mice were euthanized with an overdose of 
isoflurane and subsequently perfused with 50 ml 0.1 M phosphate-buffered saline 
(PBS, pH 7.4) followed by 60 ml cold 4% paraformaldehyde in 0.1 M PBS. Brains 
were removed, post fixed with the same fixative for 12h, and then transferred to 
30% sucrose at 4 °C. One of every six of the floating serial 50-j1m coronal sections 
throughout the hippocampus was processed for c-Fos, DCX, and BrdU immuno- 
histochemistry. After three rinses with 0.1 M PBS, the sections were incubated in 
blocking solution (0.3% Triton X-100, 5% normal goat serum in 0.1 M PBS) for 1 h 
at room temperature followed by 48 h incubation with primary antibody at 4 °C. 
Primary antibodies and their final concentrations were as follows: rabbit polyclo- 
nal anti-c-Fos (1:10,000, PC38 Millipore), rabbit anti- DCX (1:400, Cell Signaling 
Technology) and rat monoclonal anti- BrdU (1:140, OBT0030, Accurte Chemical 
& Scientific Corporation). Sections were incubated with Alex Fluor 568 goat anti- 
rabbit IgG (1:500, A11036, Invitrogen) and/or Alex Fluor 488 goat anti-rat IgG 
(1:500, A11006,Invitrogen) for 2h and counterstained with DAPI (900 nM, 
D1036, Invitrogen) for 15min at room temperature in darkness. The sections 
were washed three times with 0.1 M PBS and mounted using Fluoromount-G 
(Southern Biotech). For c-Fos staining, sections were pre-treated with 0.3% 
H,0, in PBS for 30 min at room temperature. For BrdU detection, sections were 
pre-treated with 2 M HCl for 30 min at 37 °C and washed in 0.1 M borate buffer, 
pH 8.4, for 10 min. 

Imaging and quantification. Z stacks of 2-j1m thick single-plane images at 20 X 
magnification were collected through the entire thickness of the slice by employing 
a laser scanning microscope LSM 710 (Carl Zeiss). Each slice has 12-13 z-axis 
optical slices; the sixth optical imaging was selected for counting c-Fos-, BrdU- and 
DCX-positive cells. Digital images were routed into a PC for quantitative analyses 
using Image] software (NIH). Double labelling for BrdU- and DCX-positive cells 
was assessed through the entire z axis of each cell. For quantification, six sections 
per mouse brain were counted. Resulting cell numbers were multiplied by 6 to 
obtain the estimated total number of positive cells per dentate gyrus. 

Statistical analyses. Sample sizes of mice were determined based on prior statistics 
and data phenotyping the RTT mice’’. Animals with disconnected electrode 
implants before the completion of the experiments were excluded from data 
analyses. Data were analysed using two-way repeated-measures ANOVA. If any 
of the main effects were significant, Tukey’s post hoc analysis was used for all 
pairwise multiple comparisons unless otherwise specified. In all cases, P< 0.05 
was set as the cut-off for statistical significance. SigmaPlot 12 was used to create all 
the summarized plots as well as all the statistical tests in this study. 
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Timeline of tests in forniceal DBS-treated mice 
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in weeks 
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water maze, threshold 
in vivo LTP 


Extended Data Figure 1 | Timeline of forniceal DBS tests in RTT and wild-type mice. 
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Extended Data Figure 2 | Fear memory in RTT mice and wild-type control 
animals. All mice were trained with tone-foot-shock pairings on day 0. 
Memory retention was tested 3h, 1 day, 3 day, and 7 day after training. 

a, b, Impaired fear memory in RTT mice (m = 20) compared to wild-type (WT) 
littermates (n = 20). These animals were implanted with electrodes but did not 
receive DBS or sham treatment. A significant main effect of genotype was 
observed (two-way repeated-measures ANOVA followed by Tukey’s post hoc 
test: context, F;33 = 15.32, P< 0.001; cue, F),3g = 20.70, P< 0.001). *P < 0.05; 
**P < 0.01; ***P < 0.001 versus wild type. ¢, d, Cued fear memory in RTT 
mice (n = 20) and wild-type littermates (n = 20) that were implanted with 


electrodes but without DBS or sham treatment. During the retention test, 
freezing in the tone phase (T) was significantly more than in the no tone phase 
(NT) across all the test time points in both wild-type (c) and RTT mice (d). 
e-h, Retrieval of cue fear memory in DBS- or sham-treated RTT and wild-type 
mice. During the cued memory test, all four groups of animals actively 
responded to the tone presentation (WT-sham, n = 21; WT-DBS, n = 21; 
RTT-sham, n = 14; RTT-DBS, n = 17). There was a significant increase of 
freezing time in the tone phase (T) compared to the no-tone phase (NT) at each 
of the test time points over all the groups. *P < 0.05, **P < 0.01, ***P < 0.001 
(two-tailed paired t-test). All data are presented as mean + s.e.m. 
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Extended Data Figure 3 | Increased handling, but not forniceal DBS, 
increased locomotor activity and decreased the anxiety level in RTT and 
wild-type mice. a, There was no difference among the four DBS/sham-treated 
groups in the total distance travelled in the open-field test (WT-sham, n = 20; 
WT-DBS, n = 20; RTT-sham, n = 17; RTT-DBS, n = 18; genotype, 

F, 7; = 1.13, P = 0.292; treatment, F, 7; = 0.13, P = 0.724; genotype X 
treatment, F;,7; = 0.063, P = 0.803). RTT and wild-type mice that received 
DBS/sham treatment travelled longer distances than RTT (n = 20) and wild- 
type (n = 20) animals that were implanted with electrodes but did not 
experience DBS/sham procedures, respectively. b, During the open-field test, 
there was no difference in the centre:total distance ratio among the four DBS 
groups (genotype, F\,7; = 1.22, P = 0.273; treatment, F,,7; = 0.0079, P = 0.93; 
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genotype X treatment, F;,7, = 0.081, P = 0.777). Both RTT and wild-type mice 
that received DBS/sham treatment travelled more in the centre area compared 
to implanted RTT and wild-type animals that did not recieve DBS/sham 
procedures. c, In the light/dark test there was no difference in the amount of 
time spent in the light compartment among the four chronically treated groups 
(n = 12 per group; two-way ANOVA: genotype, Fj,44 = 1.83, P = 0.183; 
treatment, F),44 = 0.057, P = 0.813; genotype X treatment, Fy 44 = 0.33, 

P= 0.567). Both RTT and wild-type mice that received DBS/sham treatment 
spent more time in the light compartment than implanted RTT (n = 15) 

and wild-type (n = 14) animals that did not receive DBS/sham procedures. 
*P <0.05,0 **P < 0.01, ***P < 0.001 (two-tailed t-test). All data are presented 
as mean + s.e.m. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a b 
Pain threshold Rotarod 
0.5 
E 04 S 
a = 
3B 03 & 
= £ 
E 02 s 
5 2 
oO 
5 0.1 Ay 
) 
0.0 
Flinch Vocal Jump 
c 
Wire 
140 
Rk 
120 ™ 
® 100 For all panels: ® 100 
£& 80 (J WT-sham & 80 
2 Gs WT-DBS xe) 
2 60 (<=) RTT-sham 2 60 
= 40 WE RTT-DBS = 40 
20 20 
: £ £ ep) . £& WN Ee WN 
dp) 
a a 
Fags Fass 
ESsSeEE ESE E 
e a : f sate ; 
Social interaction Social interaction 
_P= 0.063 
gv ge 
o 0) 
£ — 
c c 
2 S 
Oo o 
© io 
© © 
= & 


K x ne 
yore’ yore or*gor® 


eS nck 
won 


Extended Data Figure 4 | Forniceal DBS did not alter the pain threshold, 
motor function or social behaviour in RTT or wild-type mice. a, There was 
no group difference in foot shock threshold intensities to evoke flinch, 
vocalization or jumping (WT-sham, n = 14; WT-DBS, n = 14; RTT-sham, 
n= 11; RTT-DBS, n = 12; two-way ANOVA, no significant main effect of 
genotype, treatment, or genotype X treatment interaction, P > 0.05). b, Ina 
rotarod test (n = 12 mice per group), latency to fall increased over trials but 
there was no difference among the four groups (two-way repeated measures 
ANOVA: group, F3,44 = 1.68, P = 0.184; trial, F733 = 34.26, P< 0.001; 
group X trial interaction, F,) 39g = 1.22, P = 0.230). c¢, RTT mice showed 
decreased latency to fall in the wire-hang test compared to wild-type animals, 
but there was no difference between DBS- and sham-treated groups for 
either RTT or wild-type mice (n = 12 per group; two-way ANOVA: 
genotype, F; 44 = 10.41, P = 0.002; treatment, Fj,44 = 0.33, P = 0.566; 
genotype X treatment interaction, Fy 44 = 0.75, P = 0.392). d, RTT mice 


Object 


Partner 


showed a decreased latency to fall in the dowel test compared to wild-type 
animals, but there was no difference between DBS- and sham-treated groups 
for either genotype (n = 12 per group; genotype, Fi ,44 = 23.63, P< 0.001; 
treatment, F; 44 = 0.0018, P = 0.966; genotype X treatment interaction, 

F,44 = 0.83, P = 0.367). e, f, In the three chamber test, all four groups of 
animals (n = 12 per group) showed a clear preference for the partner mice 
compared to the object (e). Two-way ANOVA revealed a significant genotype 
main effect of the interaction time with the partner mice (Fj 44 = 4.56, 

P = 0.038), indicating altered social behaviour in RTT mice (P = 0.063, RTT- 
sham versus WT-sham, Tukey’s post hoc). However, DBS did not change the 
interaction time with the partners (treatment, Fy 44 = 0.28, P = 0.597; 
genotype X treatment interaction, F\44 = 0.31, P = 0.579) or the object 
(treatment, Fi 44 = 2.64, P= 0.111; genotype X treatment interaction, 

F, 44 = 0.015, P = 0.905) (f). **P < 0.01, ***P < 0.001 (Tukey’s post hoc in 
c, d; two-tailed paired t-test in e). All data are presented as mean + s.e.m. 
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Extended Data Figure 5 | Forniceal DBS did not alter the body weight, 
visual or sensorimotor skills in RTT or wild-type mice. a, All four groups 
(n = 12 mice per group) showed changes in body weight over time. Two-way 
repeated measure ANOVA revealed a significant main effect of group 

(F3,44 = 6.73, P< 0.001) and age (F4176 = 89.32, P< 0.001). Tukey’s post hoc 
showed that sham-treated RTT mice were significantly heavier than sham- 
treated wild-type mice (P = 0.015), but there was no difference in body 
weight between sham-treated and DBS-treated wild-type mice (P = 0.861) or 
between sham-treated and DBS-treated RTT mice (P = 0.099). b, Comparison 
of body weight at the age of 23 weeks among the four groups (two-way 
ANOVA: genotype, Fj,44 = 10.06, P = 0.003; treatment: Fi 44 = 1.93, 


Day of training 


Day of training 


P=0.172). c-e, Swimming test in the water maze task with a flagged platform 
(n = 18 mice per group). Sham-treated RTT mice did not have different 
escape latencies than sham-treated wild-type controls (c, two-way repeated- 
measures ANOVA: genotype, F)34 = 1.73, P = 0.197; genotype X treatment 
interaction, F,34 = 0.133, P = 0.718). DBS did not change the escape 
latencies in either wild-type controls (d; treatment, F},34 = 0.44, P = 0.513; 
treatment X day interaction, F)34 = 1.24, P = 0.273) or RTT mice 

(e, treatment, F),34 = 2.36, P = 0.134; treatment X day interaction, F},34 = 0.41, 
P= 0.524). *P < 0.05; n.s., not significant (Tukey’s post hoc). All data are 
presented as mean + s.e.m. 
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Extended Data Figure 6 | Effect of forniceal DBS on hippocampal 
electrophysiological signatures. a, Representative traces of LFPs recorded in 
the dentate gyrus 1 day before and 3 weeks after DBS/sham treatment. There 
were no electrographic seizure spikes in any of the four groups of mice after 
DBS/sham treatment. Scale bars: 10s, 1 mV. b, Input-output (I/O) curves of 
the evoked responses of the perforant path recorded in the dentate gyrus in 
DBS/sham-treated mice. For each of the four groups, I/O curves were generated 
1 day before and 3 weeks after forniceal DBS. All data points were normalized 
to the maximum value of the population spike amplitude before DBS/sham and 
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the abscissa represents the seven increments used in each mouse. The I/O 
relationship was not altered by DBS in sham-treated wild-type mice 
(WT-sham; n = 5, F\,4 = 0.062, P = 0.818), DBS-treated wild-type mice 
(WT-DBS; n = 4, F1,3 = 0.036, P = 0.861), or sham-treated RTT mice 
(RTT-sham; n = 5, F, 4 = 0.018, P = 0.901). DBS reduced the amplitude of the 
evoked population spikes from the baseline test in DBS-treated RTT mice 
(RIT-DBS; n = 5, F\4 = 6.73, P = 0.060). *P < 0.05 (Tukey’s post hoc). All 
data are presented as mean + s.e.m. 
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Extended Data Figure 7 | Unilateral forniceal DBS induces neuronal RTT-sham, 0.30 + 0.05%; RTT-DBS, 32.55 + 3.74%). b, Representative images 
activity and stimulates neurogenesis bilaterally in the dentate gyrus. showing that there were more BrdU~ (green), DCX* (red), and merged 

a, Representative images showing that expression of the Fos gene was increased _ (yellow) cells in the dentate gyrus in forniceal DBS-treated wild-type and RTT 
following forniceal DBS in wild-type and RTT mice compared to their sham mice than in their respective sham controls. Scale bar, 100 jum. Con, 

controls, respectively (percentage of ipsilateral c-Fos-positive cells over the contralateral; Ips, ipsilateral. 

dentate granule cells: WT-sham, 0.26 + 0.04%; WT-DBS, 34.52 + 4.62%; 
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Extended Data Figure 8 | The cholinergic antagonist atropine did not 
alter forniceal DBS-induced enhancement of fear memory. a, Placement 
of guide cannula and recording electrode into the dorsal hippocampus. 

b, Hippocampal infusion of 1.0 1g atropine did not change the amplitudes of 
the evoked potentials of the FFx recorded in the dentate gyrus in both RTT and 
wild-type mice. There was no difference of the population spike amplitudes 
before or after atropine infusion in both RTT mice (n = 5; one-way ANOVA, 
Fo,36 = 0.69, P = 0.715) and wild-type controls (n = 3; Fo,13 = 0.99, P = 0.485). 
c, Representative hippocampal EEG traces before and after vehicle (V) or 
atropine (A) infusion. Scale bars: 0.5 s, 0.2 mV. d, RTT mice (n = 17) showed 
less spontaneous hippocampal theta activity than wild-type animals (n = 20) 
(**P < 0.01, two-tailed t-test). e, Hippocampal infusion of atropine, but not 
vehicle, reduced hippocampal theta oscillation in both RTT and wild-type 
mice compared to their pre-infusion baselines (WT-V, n = 9; WT-A, n= 11; 
RTT-V, n= 8; RTT-A, n = 9; *P < 0.05, two-tailed paired t-test; n.s., not 
significant). f, Hippocampal microinfusion of atropine before fear conditioning 
training did not alter fear memory in forniceal DBS treated RTT mice or 
wild-type controls. Mice in all four groups (WT-V, m= 10; WT-A, n = 11; 
RTT-V, n = 12; RTT-A, n = 13) experienced 2 weeks of forniceal DBS that was 


© RTT-A(n= 13) 


finished 3 weeks before fear conditioning training. Atropine or vehicle was 
bilaterally infused into the dorsal hippocampus before training. Memory 
retention was tested 24h after training. Two-way ANOVA revealed a 
significant main effect of genotype (F1,42 = 10.27, P = 0.003), but there 

was no difference between atropine- and vehicle-treated mice (treatment, 
F,,42 = 0.34, P = 0.562; genotype X treatment interaction, F) 42 = 0.069, 

P= 0.794). Atropine did not change cued fear memory, either: two-way 
ANOVA revealed no difference between genotypes (Fj,42 = 2.99, P = 0.091) or 
between atropine- and vehicle-treated mice (treatment, Fy 4) = 0.046, 

P= 0.831; genotype X treatment interaction, F) 4. = 0.154, P= 0.697). 

*P < 0.05; n.s., not significant (Tukey’s post hoc). g, Intra-hippocampal 
atropine infusion alone did not change the basal level of freezing in the 
contextual test environment in either wild-type or RTT mice. There was no 
difference between vehicle- (n = 9) or atropine-treated (n = 6) mice (P > 0.05, 
two-tailed t-test). h, Schematic representation of the dorsal hippocampus at 
seven rostral-caudal planes (according to ref. 31) for the microinfusion sites in 
DBS-treatment experiments. The numbers on the left represent the posterior 
coordinate from the bregma. All data are presented as mean + s.e.m. 
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Control of REM sleep by ventral medulla 


GABAergic neurons 


Franz Weber!, Shinjae Chung’, Kevin T. Beier”, Min Xu!, Liqun Luo? & Yang Dan! 


Rapid eye movement (REM) sleep is a distinct brain state charac- 
terized by activated electroencephalogram and complete skeletal 
muscle paralysis, and is associated with vivid dreams’ *. Tran- 
section studies by Jouvet first demonstrated that the brainstem is 
both necessary and sufficient for REM sleep generation’, and the 
neural circuits in the pons have since been studied extensively**. The 
medulla also contains neurons that are active during REM sleep’”"’, 
but whether they play a causal role in REM sleep generation remains 
unclear. Here we show that a GABAergic (y-aminobutyric-acid- 
releasing) pathway originating from the ventral medulla powerfully 
promotes REM sleep in mice. Optogenetic activation of ventral 
medulla GABAergic neurons rapidly and reliably initiated REM 
sleep episodes and prolonged their durations, whereas inactivating 
these neurons had the opposite effects. Optrode recordings from 
channelrhodopsin-2-tagged ventral medulla GABAergic neurons 
showed that they were most active during REM sleep (REMynax)> 
and during wakefulness they were preferentially active during eating 
and grooming. Furthermore, dual retrograde tracing showed that 
the rostral projections to the pons and midbrain and caudal projec- 
tions to the spinal cord originate from separate ventral medulla 
neuron populations. Activating the rostral GABAergic projections 
was sufficient for both the induction and maintenance of REM 
sleep, which are probably mediated in part by inhibition of REM- 
suppressing GABAergic neurons in the ventrolateral periaqueductal 
grey. These results identify a key component of the pontomedullary 
network controlling REM sleep. The capability to induce REM 
sleep on command may offer a powerful tool for investigating its 
functions. 

Previous studies showed that the ventral medulla (vM) contains 
GABAergic neurons expressing the immediate early gene c-fos after 
deprivation-induced REM sleep rebound'’”’, suggesting REM sleep- 
related activity. To test the causal relationship between VM GABAergic 
neuron activity and brain states, we injected Cre-inducible adeno- 
associated viruses (AAV) expressing channelrhodopsin 2 fused with 
enhanced yellow fluorescent protein (ChR2-eYFP) locally into the vM 
of GAD2-Cre mice (Fig. 1a). Laser stimulation (20 Hz, 120 s per trial) 
was applied every 15-25 min, and brain states—wake, REM, and non- 
REM (NREM) sleep—were classified on the basis of electroencephalo- 
gram (EEG) and electromyogram (EMG) recordings. REM sleep was 
observed at a high probability during laser stimulation (Fig. 1b and 
Supplementary Video 1). To quantify the effect, we aligned all trials 
from six mice by the time of laser stimulation (Fig. 1c). We found a 
rapid, ~12-fold increase in REM sleep within 30s of laser onset 
(P < 0.001, bootstrap) and a complementary decrease of NREM sleep. 
The EEG power spectrum and EMG activity during the laser-induced 
REM state were indistinguishable from those during spontaneous 
REM sleep outside laser stimulation periods (Extended Data Fig. 1). 
In control mice expressing eYFP without ChR2, laser had no effect 
(Extended Data Fig. 2), and the laser-induced change in the probability 
of REM sleep was significantly different between the ChR2 and control 
mice (P < 0.001, bootstrap). Furthermore, ChR2-mediated activation 


of vM glutamatergic neurons reliably induced wakefulness rather than 
REM sleep (Extended Data Fig. 3), indicating that the REM-promoting 
effect was specific to GABAergic neurons. 

The complementary changes in the probabilities of REM and 
NREM (Fig. 1c) suggest that vM GABAergic neuron activation prim- 
arily triggered NREM to REM transitions. To test this possibility, we 
analysed the effect of laser on the transition probability between each 
pair of brain states. Laser stimulation markedly enhanced the NREM 
to REM transitions (P = 0.02, bootstrap; Extended Data Fig. 4a). As a 
result, for trials in which laser onset fell on NREM sleep, the probability 


Power (uV? s) 
O Milam 750 


Ni a | hv UN oh 


300s === REM == Wake NREM 


Trials with NREM 
before laser (n = 97) 


Trials with wakefulness 
before laser (n = 46) 


Trials 


0.5 0.5 0.5 


0 Aa 


-300 -120 0 420 300 -300 -120 0 120 300 
Time (s) Time (s) 


Probability 


0 
-300 -120 0 120 300 
Time (s) 


Figure 1 | Optogenetic activation of VM GABAergic neurons induces 
REM sleep. a, Top, schematic of optogenetic experiment (adapted with per- 
mission from The Mouse Brain in Stereotaxic Coordinates, 3rd edn, Franklin, 
K.B.J. and Paxinos, G., 88, Copyright Elsevier (Academic Press, 2007)*"). 
Bottom, fluorescence image of vM (dashed box in schematic) in a GAD2-Cre 
mouse injected with AAV expressing ChR2-eYFP (green). Blue, 4’ ,6-diamidino- 
2-phenylindole (DAPI). GI, gigantocellular reticular nucleus; LPGI, lateral 
paragigantocellular nucleus; IO, inferior olive; Amb, ambiguus nucleus. 
ChR2-eYFP was expressed within ~400 jim from injection site and mainly 
localized within the LPGI. b, Example experiment. Shown are EEG power 
spectrogram (Freq., frequency), EMG amplitude (Ampl.), and brain states 
(colour coded). Blue shading, laser stimulation period (20 Hz, 120s). ¢, Brain 
states in all trials from six mice (top) and probability of wake, NREM, or REM 
states (bottom) before, during, and after laser stimulation. Shading, 95% CI. 
Blue bar, laser stimulation period. d, Similar to c, for trials with laser onset 
falling on NREM sleep (probability of NREM is 1 immediately before laser 
onset). e, Trials with laser onset falling on wakefulness. 
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of REM sleep increased to 85% within 50s (Fig. 1d), and NREM to 
REM transitions occurred in 94% of these trials. Notably, laser stimu- 
lation caused no significant change in NREM to wake (P = 0.24) anda 
reduction in the probability of REM to wake (P< 0.001) transitions 
(Extended Data Fig. 4b, c). Thus the gradual increase of wakefulness 
during laser stimulation (Fig. 1c, purple) was not due to a direct induc- 
tion of wakefulness, but was a secondary consequence of laser-induced 
increase in REM sleep (in rodents, periods of REM sleep are often 
shorter than 120s and typically followed by wakefulness). Further- 
more, direct wake to REM transitions, characteristic of narcolepsy, 
were not observed (Extended Data Fig. 4e), indicating that optogenetic 
activation of VM GABAergic neurons triggered normal but not patho- 
logical transitions into REM sleep. This also explains why in trials in 
which the laser was turned on during wakefulness, no increase in REM 
sleep was observed (Fig. le and trial 3 in Supplementary Video 1). 

To test whether these neurons also contribute to the maintenance of 
REM sleep, we applied a closed-loop protocol", in which laser stimu- 
lation was initiated after spontaneous onset of REM sleep and main- 
tained throughout the REM episode (Fig. 2a). The stimulation was 
applied randomly in ~50% of the episodes, and REM duration was 
compared between the episodes with and without stimulation (Fig. 2b). 
Laser stimulation increased the mean duration of REM episodes by 
111% in GAD2-ChR2 mice (laser, 148.0 + 19.2s (mean + s.d.); no 
laser, 70.2 + 10.18; P = 0.00037, paired t-test, n = 6 mice) but not in 
eYFP control mice (laser, 77.0 + 14.9 s; no laser, 83.8 + 13.6 s; P = 0.62, 
n = 4; Fig. 2c). Conversely, archaerhodopsin (Arch)- or halorhodopsin 
(Halo)-mediated silencing of vM GABAergic neurons (green laser) 
caused a 41% reduction of REM duration (laser, 54.1 + 9.8 s; no laser, 
91.2 + 11.7 s; P= 0.001, n = 4), an effect not observed in eYFP control 
mice (laser, 90.0+12.9s; no laser, 79.5+9.6s; P=0.075, n=4 
Fig. 2d). Thus, vM GABAergic neuron activity also contributes strongly 
to REM sleep maintenance. In addition, we pharmacogenetically 
silenced VM GABAergic neurons by expressing hM4Di'°*. Compared 
with vehicle injection, inhibition of these neurons by clozapine-N-oxide 
(CNO) injection strongly reduced REM sleep in a dose-dependent 
manner (Extended Data Fig. 5), further indicating the importance of 
vM GABAergic neurons in REM sleep generation. 

Although vM GABAergic neurons have been shown to express c-fos 
after deprivation-induced REM sleep rebound’, the slow time 
course of c-fos expression and its modulation by non-activity-related 
factors'® limit its precision in identifying REM-active neurons. To 
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Figure 2 | Activation of VM GABAergic neurons prolongs REM sleep 
duration. a, Schematic of closed-loop stimulation. Laser was turned on after 
spontaneous REM onset and turned off at termination of the REM episode. 
b, Example recording containing REM episodes with and without laser 
stimulation (blue shading). c, Mean REM sleep duration with and without 
laser stimulation, in ChR2 (left, n = 6) and eYFP control (right, n = 4) mice. 
Each pair of dots, data from one mouse. Error bar, s.d. d, Similar to c, but with 
green laser stimulation in Arch/Halo (left, n = 4) and eYFP control (right, 
n= A) mice. 
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understand how vM GABAergic neurons regulate REM sleep under 
natural conditions, we recorded their spiking activity across brain 
states. Since the vM contains multiple spatially intermingled cell 
types'’, we tagged the GABAergic neurons with ChR2 by crossing 
GAD2-Cre with ChR2 reporter (Ai 32) mice. Recordings were made 
in freely moving mice by using optrodes, consisting of an optical fibre 
surrounded by multiple microelectrodes. High-frequency laser pulses 
(15 or 30 Hz) were applied intermittently, and single units exhibiting 
reliable laser-evoked responses with short latencies and low jitter were 
identified as GABAergic neurons (Fig. 3a-c and Extended Data 
Fig. 6a). The identified neurons consistently fired at high rates during 
REM sleep, low rates during NREM sleep, and variable rates during 
wakefulness’ (Fig. 3d and Supplementary Video 2). When the firing 
rates were averaged within each state, all 20 identified GABAergic 
neurons were most active during REM sleep (P< 0.005, Wilcoxon 
rank-sum test; Fig. 3e-g, referred to as ‘REM,nax’ neurons). The mean 
firing rate of these neurons during REM sleep (34.6 spikes per second, 
95% confidence interval (CI) 23.6-48.0 spikes per second) was in fact 
higher than the laser stimulation frequency applied in our experiments 
(Figs 1 and 2), suggesting that their activity is sufficient for the 
observed REM promoting effects. Their firing rates increased gradu- 
ally over ~30s before the NREM to REM transition and decreased 
abruptly at the end of REM sleep (Extended Data Fig. 7). Such a 
temporal profile is well suited for both the induction and maintenance 
of REM sleep. 

Since during wakefulness the firing rates of VM GABAergic neurons 
were highly variable, we analysed their relationship with different 
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Figure 3 | Firing rates of identified VM GABAergic neurons across brain 


states. a, Example recording of spontaneous and laser-evoked spikes from a 
vM neuron. Blue ticks, laser pulses (30 Hz). b, Comparison between laser- 
evoked (blue) and averaged spontaneous (red) spike waveforms from this unit. 
c, Spike raster showing multiple trials of laser stimulation at 15 and 30 Hz. 

d, Firing rates of an example VM GABAergic neuron. e, Left, mean firing rates 
of the neuron in d during different brain states. Error bar, s.e.m. Right, firing 
rates of the neuron during different wakeful behaviours. f, Firing rates of 

20 identified VM GABAergic neurons during different brain states. Each line 
shows firing rates of one unit; grey bar, average across units. g, Relative firing 
rates across brain states. The rates of each neuron were normalized by its 
maximum. All 20 neurons showed maximum firing rates during REM sleep 
(the difference was significant at P < 0.005 for all 20 neurons compared with 
NREM and wakefulness; Wilcoxon rank-sum test, post-hoc Bonferroni 
correction). h, Relative firing rates of 17 vM GABAergic neurons during 
different wakeful behaviours. 
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wakeful behaviours (classified on the basis of video recordings 
(Supplementary Video 2)). Of the 17 identified GABAergic neurons 
with recording periods encompassing multiple behaviours, 82% were 
most active during eating (n = 8) or grooming (n = 6) and much less 
active during moving or running (Fig. 3h and Extended Data Fig. 6c). 
Such a firing pattern is distinct from that of noradrenergic neurons in 
the locus coeruleus (a known postsynaptic target of VM GABAergic 
neurons'*), which are strongly suppressed during eating and 
grooming”. Interestingly, activation of VM GABAergic neurons during 
wakefulness increased eating (Extended Data Fig. 8), indicating a causal 
relationship between their firing and the eating behaviour. Among the 
24 unidentified neurons recorded in the vM (likely to include non- 
GABAergic neurons), 71% were also REMmax”’””? (Extended Data 
Fig. 9a, b). Compared with the identified GABAergic neurons, however, 
the unidentified group showed a delayed firing rate increase before 
REM sleep onset (~ 10s) and a more gradual decrease at the end of 
REM sleep (Extended Data Fig. 9c, d). In addition, many of them were 
most active during running or moving (Extended Data Fig. 9b, right). 

Previous studies showed that vM neurons project both rostrally to 
the pons, midbrain, and other brain regions and caudally to the spinal 
cord’*”°. We wondered which pathway mediates the REM-promoting 
effect. The pons and midbrain are known to be crucial for REM 
sleep generation****'”. Injection of Cre-inducible AAV expressing 
ChR2-eYFP into the vM of GAD2-Cre mice revealed extensive axon 
projections to these regions (Fig. 4a). In particular, the ventrolateral 
periaqueductal grey (vIPAG) is thought to suppress REM sleep by 
providing GABAergic inhibition to REM-promoting neurons””*. To 
test the effect of vIPAG GABAergic neuron activity on brain states, we 
injected Cre-inducible AAV expressing ChR2-eYFP into the vIPAG of 
GAD2-Cre mice. Laser stimulation almost completely suppressed 
onset of REM sleep (P< 0.001, bootstrap, n = 6) and shortened the 
duration of REM episodes (laser, 37.1 + 10.25; no laser, 82.8 + 9.9 s; 
P = 0.003, paired t-test, n = 5) while strongly promoting NREM sleep 
(P< 0.001, bootstrap; Fig. 4c). Rabies-virus-mediated monosynaptic 
retrograde tracing confirmed that vIPAG GABAergic neurons receive 
direct inhibitory innervation from vM neurons (Fig. 4d, e). Further- 
more, optogenetic activation of VM GABAergic neuron axons within 
the vIPAG caused large increases both in the initiation (P< 0.001, 
bootstrap, n = 5) and in the duration (laser, 130.7 + 32.9; no laser, 
59.6 +9.6s; P=0.02, paired t-test, n =5) of REM sleep episodes 
(Fig. 4f), with magnitudes comparable to those found with stimulation 
of vM cell bodies (Figs 1 and 2). Although in principle stimulation of 
axon fibres in the vIPAG could antidromically activate neuronal cell 
bodies in the vM and axon collaterals to other regions, simultaneous 
injections of retrograde tracers to both the pons and spinal cord 
showed little overlap between the labelled neurons (1.3%, Fig. 4b), 
indicating that the rostral and caudal projections arise largely from 
separate vM neuron populations. Thus, the rostral projections from 
vM GABAergic neurons exert a strong REM sleep-promoting effect, 
probably mediated at least in part by direct inhibition of the REM- 
suppressing vIPAG GABAergic neurons. 

Although the medulla is known to be involved in REM sleep gen- 
eration, previous studies have focused primarily on its role in inducing 
muscle atonia, through projections to the spinal cord*?”. Our findings 
indicate that VM GABAergic neurons constitute a critical component 
of the core network, generating not only muscle atonia but also the 
cortical activation associated with REM sleep’”****, The effect is 
largely, if not completely, attributable to the rostral projections 
(Fig. 4), consistent with the previous finding that inactivation of the 
pons or transection at the pontomedullary junction blocks the muscle 
atonia evoked by stimulation of the medulla**’*®. The projection to the 
locus coeruleus is likely to provide GABAergic inhibition of noradre- 
nergic neurons’*”’, whose activity enhances wakefulness” and excit- 
ability of spinal motor neurons”. Inhibition of the vIPAG GABAergic 
neurons (Fig. 4c) should cause disinhibition of the REM-promoting 
neurons in the pons**", which may in turn trigger muscle atonia 
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Figure 4 | Inhibition of vIPAG GABAergic neurons by vM projections 
promotes REM sleep. a, Top, schematic showing injection of AAV expressing 
ChR2-eYFP into the vM of a GAD2-Cre mouse. Bottom, fluorescence 
image of vM axons in the pons and midbrain (position of coronal section 
indicated by dashed line in schematic). Scale bar, 500 jim; green, eYFP; blue, 
DAPI. b, Top, schematic showing simultaneous injections of RetroBeads (RB) 
in the pons and FluoroGold (FG) in the spinal cord. Bottom, fluorescence 
image of vM, showing neurons labelled by RB and FG. Among the 1235 FG- 
and 881 RB-labelled neurons, only 27 were double labelled. Scale bar, 500 um. 
c, Left, probability of wake, NREM, or REM states before, during, and after 
laser stimulation of vIPAG GABAergic neurons (20 Hz, 300 s; n = 6 mice). 
Shading, 95% CI; Blue bar, laser stimulation period (20 Hz, 300 s). Right, mean 
REM durations with and without vIPAG stimulation (n = 5 mice). Each pair of 
dots, data from one mouse. Error bar, s.d. d, Left, schematic showing rabies- 
mediated trans-synaptic tracing. TVA, EnvA receptor; G, rabies glycoprotein; 
RVdG, G-deleted rabies virus. Middle, fluorescence image of vIPAG in a 
GAD2-Cre mouse. Scale bar, 500 um. Bottom middle, enlarged view of region 
in white box showing starter cells (yellow, expressing both GFP and mCherry, 
arrowheads; scale bar, 20 tm). Right, rabies-labelled presynaptic neurons in 
vM (same brain as in middle panel). Scale bar, 500 im. e, Rabies-labelled 
presynaptic neurons in vM are GABAergic and glycinergic. Lower panel, 
enlarged view of region in white box, containing GFP-labelled neurons 
expressing glycine (Gly, arrowheads; scale bar, 50 jm). GABA and glycine 
coexist in a high percentage of vM neurons (Extended Data Fig. 10), suggesting 
that the glycinergic rabies-labelled neurons in the vM are also GABAergic. In 
total, 82% (185/226) rabies-labelled cells were glycine positive (n = 3 mice). 
f, Left, probability of wake, NREM, or REM states before, during, and after laser 
stimulation of vM axons in vIPAG (20 Hz, 120s; n = 5 mice). Shading, 95% CI. 
Right, mean REM durations with and without vM axon stimulation (n = 5 
mice). Error bar, s.d. 


through their projections to the medulla and spinal cord, and cortical 
activation through their projections to midbrain and forebrain regions. 
The glutamatergic REM-active neurons in the dorsal pons® may also 
innervate the vM GABAergic neurons”, thus shaping their firing rates 
across different brain states (Fig. 3). 

In addition to the vM, the dorsal medulla also contains REMmax 
GABAergic neurons projecting to the locus coeruleus”, which may 
also contribute to the generation of REM sleep. The induction of 
NREM to REM, but not wake to REM, transitions by vM neurons 
points to a robust mechanism of wake maintenance—probably invol- 
ving orexin/hypocretin neurons*—that cannot be overridden by 
activating the vM GABAergic neurons. How this mechanism interacts 
with the circuit that generates REM sleep remains to be elucidated. 
Furthermore, a long-standing mystery is what functions are served by 
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REM sleep and its associated dreaming. The ability to control REM 
sleep at a high temporal precision, as demonstrated in this study, 
provides a powerful tool for studying its functions. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Animals. All experimental procedures were approved by the Animal Care and Use 
Committee at the University of California, Berkeley. Optogenetic manipulation 
experiments were performed in male or female GAD2-Cre (Jackson Laboratory 
stock 010702) and VGLUT2-Cre (016963) mice. Pharmacogenetic, rabies-mediated 
trans-synaptic tracing and anterograde tracing experiments were performed in male 
or female GAD2-Cre mice. For optrode recording experiments, male GAD2-Cre 
mice were crossed with female loxP-flanked-ChR2-eYFP mice (012569). 
Retrograde tracing experiments with RetroBeads (Lumafluor) and FluoroGold 
(Fluorochrome) were performed in male or female wild-type (C57) mice. 
Surgery. Adult (6- to 12-week-old) mice were anaesthetized with isoflurane (3% 
induction, 1.5% maintenance) and placed on a stereotaxic frame. Body temperature 
was kept stable throughout the procedure by using a heating pad. After asepsis, the 
skin was incised to expose the skull, and the overlying connective tissue was 
removed. For optogenetic experiments, a craniotomy (~ 1mm diameter) was 
made above the right cerebellum (6.8 mm posterior to the bregma, 0.6-0.9 mm 
lateral). Virus (AAV2-EFlo-FLEX-ChR2-eYFP, AAV2-EFlo-FLEX-eYFP, 
AAV2-EF1o-FLEX-eNpHR3.0-eYFP, AAV2-CAG-FLEX-ArchT-tdTomato, pro- 
duced by University of North Carolina Vector Core, or AAV-DJ-EFlo-FLEX- 
ChR2-eYFP) was loaded into a sharp micropipette mounted on a Nanoject II 
and injected slowly at a depth of 4.8 mm from the brain surface (600-800 nl). An 
optical fibre (200 jum diameter) was inserted with the tip ~400 jum above the virus 
injection site. To stimulate axon projections of VM GABAergic neurons in the 
vIPAG, the optical fibre was implanted on top of the vIPAG (4.8 mm posterior to 
the bregma, 0.6 mm lateral, 2.3 mm depth). To activate vIPAG GABAergic neurons, 
AAV expressing ChR2-eYFP was injected into the vIPAG. For pharmacogenetic 
experiments, AAV8-hSyn-FLEX-hM4D(Gi)-mCherry was injected bilaterally 
(300-800 nl on each side). For EEG and EMG recordings, a reference screw was 
inserted into the skull on top of the left cerebellum. EEG recordings were made 
from two screws on top of the left and right cortex (1 mm posterior to the bregma). 
Two EMG electrodes were inserted into the left and right neck muscles. The optical 
fibre, screws, and EEG/EMG electrodes were secured to the skull using dental 
cement. Without optogenetic manipulation, the sleep architecture quantified by 
the percentage, average duration and frequency of each brain state of these 
implanted animals was well within the range reported for mice without these 
implants, indicating that virus injection and optical fibre implantation did not cause 
large chronic changes in sleep architecture. For optogenetic experiments, data from 
animals where the tip of the optical fibre was not within the aimed location were 
excluded. For pharmacogenetic experiments, data from animals where virus 
expression was not restricted to the vM were excluded. 

For optrode recording experiments, the optrode assembly was inserted at a 
depth of 4.8mm. A screw was attached to the frontal skull for EEG recording, 
and an EMG electrode was inserted into the right neck muscle. The optrode 
assembly, screws, and EEG/EMG electrodes were secured to the skull using 
dental cement. 

For rabies-mediated retrograde tracing experiments, we first injected 100-150 nl 
of a combination of AAV5-CAG-FLEX-TC? (2.6 X 10’ vector genomes per mil- 
lilitre) and AAV8-CAG-FLEX-RG (1.3 X 10!” vector genomes per millilitre)** 
(addgene 48332 and 48333, respectively) into the vIPAG (4.8mm posterior to 
the bregma, 0.6 mm lateral, 2.5 mm depth) of GAD2-Cre mice. Three weeks later, 
RVdG (titre 5 x 10° colony forming units per millilitre) pseudotyped with EnvA 
was injected into the vIPAG. AAV5-CAG-FLEX-TC? and AAV-CAG-FLEX-RG 
were purchased from the University of North Carolina viral core, and RVdG was 
amplified in house from a stock derived from the Salk Institute viral core. Data from 
mice where starter cells were found outside the vIPAG were excluded. For controls, 
we injected the same viruses in the same temporal sequence into wild-type mice 
(n = 2). No GFP-labelled cells were detected in the vM. 

For dual retrograde tracing experiments, we injected RetroBeads bilaterally into 
the pons (5.0-5.2 mm posterior to the bregma, 0.9 mm lateral) at several depths 
(3.0, 3.5, and 4.0 mm, 150-200 nl at each depth). The spinal cord was exposed by a 
unilateral laminectomy of a cervical vertebra. After a small incision in the dura, 
FluoroGold was injected (0.7 mm lateral, 1 mm depth, 400-700 nl). Histology and 
in vivo experiments were performed more than 10 and 21 days after injection. 
Polysomnographic recordings. Animals were housed on a 12-h dark/12-h light 
cycle (light on between 7:00 and 19:00). Behavioural experiments were performed 
between 13:30 and 18:30. EEG and EMG electrodes were connected to flexible 
recording cables via a mini-connector, and recordings were made in the animal’s 
home cage placed in a sound-attenuation box. Recordings started after more than 
1h of habituation. The signals were recorded with a TDT RZ5 amplifier, filtered 
(1-750 Hz) and digitized at 1,500 Hz. Brain states were classified into NREM sleep, 
REM sleep, and wakefulness using custom-written MATLAB software. The clas- 
sification was performed without any information about the identity of the animal 
or laser stimulation timing. First, we calculated the power spectrum of the EEG 
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and EMG using a 5s sliding window, sequentially shifted by 2.5s increments. 
Next, we summed the EEG power in the ranges from 1 to 4 Hz and from 6 to 
12 Hz, yielding a time-dependent delta and theta power, respectively. For further 
analysis, we divided the theta by the delta power (theta/delta ratio). We also 
computed the total EMG power from 20 to 300Hz. For each time point, we 
determined the brain state using a threshold algorithm. A state was classified as 
NREM if the delta power was lower than its mean (averaged over the whole 
recording session) and if the EMG power was lower than its mean plus one 
standard deviation. A state was classified as REM if (1) the delta power was lower 
than the average, (2) the theta/delta ratio deviated more than one standard devi- 
ation from its mean, and (3) the EMG power was lower than its mean plus one 
standard deviation. All remaining states were classified as wake. The wake state 
thus encompassed states with high EMG power (active awake), or low delta power 
without elevated EMG activity or theta/delta ratio (quiet awake). Finally, we 
manually verified the automatic classification to ensure that all states were cor- 
rectly assigned. 

Behavioural monitoring. To classify different wakeful behaviours, we made video 
recordings (sampling rate, 5 Hz) using a camera placed on top of the cage. Wakeful 
behaviours were divided into four categories: eating, grooming, moving, and 
running. The classification was performed manually, on the basis of the video, 
EEG, and EMG recordings, using a custom-written graphical user interface (pro- 
grammed in MATLAB). The experimenter classifying the wakeful behaviours was 
blind to the timing of laser stimulation or the identity of the animal. 
Optogenetic manipulation. To test the role of VM GABAergic neurons in REM 
sleep induction, we applied blue laser stimulation (473 nm, 5 mW at fibre tip) at 
20 Hz (10 ms per pulse). Each trial lasted for 120 s (stimulation of VM GABAergic 
neurons and axon projections) or 300s (stimulation of vIPAG GABAergic 
neurons), and the inter-trial interval was chosen randomly from a uniform 
distribution between 15 and 25 min. For each animal we recorded at least two 
5h sessions. 

To test the role of VM or vIPAG GABAergic neurons in REM sleep maintenance, 
we applied a closed-loop stimulation protocol. The animal’s brain state was clas- 
sified by real-time analysis of EEG/EMG recordings. As soon as REM sleep was 
detected, the laser was turned on with 50% probability, and turned off only when 
the REM episode ended. This allowed comparison of the durations of REM epi- 
sodes with and without laser stimulation within the same recording session. REM 
periods were assigned to either the experimental or control group according 
to a pseudo-random Bernoulli sequence generated before the start of the experi- 
ment. For each animal at least two 5h sessions were recorded. In ChR2-mediated 
activation experiments, we used a blue laser (473 nm, 5mW, 20 Hz). In Arch/ 
Halo-mediated inhibition experiments, we used a green laser (532 nm, 20 mW, 
step pulse). 

Pharmacogenetic manipulation. To inhibit VM GABAergic neurons, we injected 
CNO dissolved in 0.1 ml vehicle solution (PBS with 0.5% dimethylsulfoxide 
(DMSO)) into GAD2-Cre mice expressing hM4Di in the vM, 20 min before the 
recording session. CNO was administered intraperitoneally at two doses (1 and 
5mgkg ' body weight) on different days. For the control experiment, we injected 
the vehicle solution without CNO. For experimental and control recordings, ani- 
mals were subjected to the same behavioural manipulations. 

Optrode recording. To record the activity of VM GABAergic neurons, we used a 
custom-built optrode”’, consisting of an optical fibre (0.1 or 0.2mm diameter) 
surrounded by 12 microwire electrodes (Stablohm 675) twisted into stereotrodes 
or tetrodes. The electrode tips were electroplated in a chloride-platinum solution 
to an impedance of ~600 kQ. The optical fibre and electrodes were inserted into a 
screw-driven microdrive. The optrode was slowly lowered in 25-50 um steps to 
search for light-responsive neurons. The signals were recorded using a TDT RZ5 
amplifier, filtered (0.3-8 kHz) and digitized at 25 kHz. Recordings were performed 
over a period of 1-2 months from each mouse. At the end of the experiment 
electrolytic lesions were made by passing a current (100 1A, 10s) through one 
or two electrodes to identify the end of the recording tract. 

Spike sorting. Spikes were sorted offline on the basis of the waveform energy and 
the first three principal components of a spike waveform on each stereotrode or 
tetrode channel. Single units were identified either manually using the software 
Klusters (http://neurosuite.sourceforge.net) or automatically using the software 
KlustaKwik (http://klustakwik.sourceforge.net). The quality of each unit was 
assessed by the presence of a refractory period and quantified using isolation 
distance and L,atio°’. Units with an isolation distance <19 or Lyatio > 0.1 were 
discarded. For units recorded with stereotrodes, the median values of isolation 
distance and L,ati. were 34.4 and 0.001. For units recorded with tetrodes, the 
corresponding median values were 38.3 and 0.023. 

Identification of GABAergic units. To identify ChR2-expressing VM GABAergic 
neurons, high-frequency laser pulse trains (15 and 30 Hz with duration of 1 and 
0.5 s, respectively) were delivered every 1 or 2 min. The laser power and pulse 
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duration were optimized to identify light-responsive neurons without changing 
the brain state. A unit was identified as GABAergic if spikes were evoked by laser 
pulses (both 15 and 30 Hz trains) at high reliability (>0.6), short first-spike latency 
(<6 ms), and small jitter (<2 ms), and if the waveforms of the laser-evoked and 
spontaneous spikes were highly similar (correlation coefficient >0.9; Extended 
Data Fig. 6a). To compute the mean firing rate of each neuron in each brain state, 
spikes during the laser pulse trains were excluded. Twenty out of twenty-one 
identified GABAergic units had their recording periods encompassing all three 
brain states. For 17 units the recording periods encompassed all four wakeful 
behaviours. 

In total we recorded from 21 identified GABAergic units and 24 unidentified 
units. However, the ratio (21/24) is probably affected by sampling bias and thus 
may not reflect the actual percentage of ChR2-labelled GABAergic neurons in the 
vM. In general, the success rate for finding a neuron that is reliably activated by 
laser is very low. To maximize the rate of data collection from identified neurons, 
we spent the 1-2 h on recording (instead of moving the electrode to a new location) 
only if we found at least one unit that appeared to be activated by the laser pulses. 
Asa result, most of the unidentified units were recorded simultaneously with other 
laser-driven, putative GABAergic units. 

Histology and immunohistochemistry. Mice were deeply anaesthetized and 
transcardially perfused with 0.1M PBS followed by 4% paraformaldehyde (w/v) 
in PBS. For fixation, brains were kept overnight in 4% paraformaldehyde. For 
cryoprotection, brains were placed in 30% sucrose (w/v) in PBS solution for at least 
two nights. After embedding and freezing, brains were sectioned into 20, 30, or 
60 um coronal slices using a cryostat. For immunohistochemistry, non-specific 
binding sites were blocked by incubating the sections in 2% donkey or goat serum 
in PBST (0.3% Triton X-100 in PBS). Brain sections were stained with an anti- 
choline acetyltransferase antibody (AB144, Millipore; 1:250) for cholinergic neu- 
rons, and anti-glycine (AB5020, Millipore, 1:100) for glycinergic neurons. To 
amplify the fluorescence of axon fibres expressing ChR2-eY FP, we used an antibody 
against GFP (A11122 or A11120, Life Technologies, 1:1,000). Brain sections were 
incubated with the primary antibody in blocking solution for two nights. Species- 
specific secondary antibodies conjugated with red or green Alexa fluorophores 
(donkey anti-mouse, A21202, 1:1,000; goat anti-rabbit, A11008, 1:1,000; donkey 
anti-goat, A11058, 1:250) in PBS were applied for 2h. Finally, slides were washed 
for 2h in PBS and mounted with Fluoromount. Fluorescence images were taken 
using a confocal microscope (LSM 710 AxioObserver Inverted 34-Channel 
Confocal) and Nanozoomer. 

Statistics. For optogenetic experiments, GAD2-Cre mice were randomly 
assigned to control (injected with AAV expressing eYFP) and experimental 
groups (injected with AAV expressing ChR2-eYFP, eNpHR3.0-eYFP, or ArchT- 


tdTomato). Experimental and control animals were subjected to exactly the same 
surgical and behavioural manipulations. No randomization was used for rabies- 
mediated, anterograde or dual retrograde tracing experiments, pharmacogenetic 
recordings and optrode recordings. Unless stated otherwise, investigators were not 
blinded to animal identity and outcome assessment. 

All statistical tests (paired t-test, Wilcoxon rank-sum test, Wilcoxon signed- 
rank test, bootstrap) were two-sided. For both paired and unpaired tests, we 
ensured that the variances of the data were similar between the compared groups. 
For t-tests, we verified that the data were normally distributed using Lilliefors test 
for normality. 

The 95% CIs for brain state probabilities were calculated using a bootstrap 

procedure. For an experimental group of n mice, with mouse i comprising m; 
trials, we calculated the CI as follows: we repeatedly resampled the data by ran- 
domly drawing for each mouse m; trials (random sampling with replacement). For 
each of the 10,000 iterations, we recalculated the mean probabilities for each brain 
state across the n mice. The lower and upper CIs were then extracted from the 
distribution of the resampled mean values. To test whether a given brain state was 
significantly modulated by laser stimulation, we calculated for each bootstrap 
iteration the difference between the mean probabilities during laser stimulation 
and the preceding period of identical duration. From the resulting distribution of 
difference values, we then calculated a P value to assess whether laser stimulation 
significantly modulated the brain state. 
Samples sizes. To determine the sample sizes of the experimental groups for 
optogenetic and pharmacogenetic experiments, for each group we first performed 
pilot experiments with two or three mice. Given the strength of the effect and the 
variance across this group, we then predicted the number of animals required to 
reach sufficient statistical power. To determine the sample size (number of units) 
for optrode recordings, we first recorded from two animals. Given the success rate 
of finding identified units and the homogeneity of units in the initial data set, we 
set a target sample size of 20 units. For rabies-mediated, anterograde, or dual 
retrograde tracing experiments, the selection of the sample size was based on 
numbers reported in previous studies. Otherwise, no statistical methods were used 
to predetermine sample size. 
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Extended Data Figure 1 | Comparison of spontaneous and laser-induced 
REM sleep. a, Mean EEG spectrogram and EMG amplitude before and 

after laser onset (averaged across all trials with laser onset falling on NREM 
sleep). b, Mean EEG spectrogram and EMG amplitude before and after 
spontaneous REM onset outside laser stimulation periods (only REM episodes 
with duration >70s were included). c, Comparison of EEG power spectra 
during spontaneous (grey) and laser-induced (blue) REM sleep. Blue shading, 
s.e.m. for laser-induced REM sleep. 
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of laser stimulation. Laser stimulation caused no significant change in the 
probability of any brain state (P > 0.34, bootstrap). b, Similar to a, for trials in 
which laser onset fell on NREM sleep. c, Trials in which laser onset fell on 


during, and after laser stimulation (bottom). Shading, 95% CI. Blue bar, period — wakefulness. 
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Extended Data Figure 2 | Effect of laser stimulation on brain states in eYFP 
control mice. a, Brain states in all trials from five mice aligned by time of 
laser stimulation (top) and probability of wake, NREM, or REM states before, 
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Extended Data Figure 3 | Optogenetic activation of vM glutamatergic 
neurons induces wakefulness. a, Probability of wake, NREM, or REM states 
before, during, and after laser stimulation (20 Hz, 120s) in VGLUT2-Cre 
mice injected with AAV expressing ChR2-eYFP into the vM (n = 3 mice). 
Shading, 95% CI. Blue bar, period of laser stimulation. Laser stimulation caused 
a significant increase in wakefulness (P < 0.001, bootstrap) and decrease in 


Time (s) 
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Time (s) 
NREM sleep (P< 0.001). b, Mean durations of REM sleep episodes with 
and without laser stimulation. Each pair of dots represents data from one 
mouse. Laser stimulation shortened the duration of REM sleep episodes 
(n = 3 mice, P = 0.008, paired t-test). Error bar, s.d. c, Probability of wake (left), 
NREM (middle), and REM (right) states before, during, and after laser 
stimulation (20 Hz, 120s) at different laser powers (colour coded). 
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Extended Data Figure 4 | Effect of laser stimulation of vM GABAergic 
neurons on the transition probability between each pair of brain states in 
ChR2 and eYFP control mice. a-f, ChR2 control mice. a, Probability of 
NREM (N) to REM (R) state transition within each 20s period in ChR2 
mice. Blue shading, period of laser stimulation (20 Hz, 120s). Error bar, s.d. 
(bootstrap, n = 6 mice). Probability of baseline transition (red dashed line) 
was computed after excluding the laser stimulation period. The probability 
during laser stimulation was significantly higher than the baseline (P = 0.02, 
bootstrap). b, Similar to a, for NREM to wake (W) transition. The probability 
during laser stimulation was not significantly different from baseline 

(P = 0.24). c, REM to wake, probability during laser stimulation was signifi- 
cantly lower than baseline (P < 0.001), consistent with the effect of VM 
GABAergic neurons on prolonging REM duration (Fig. 2). d, REM to NREM, 
which rarely occurs in rodents. Laser stimulation caused no significant effect 
(P> 0.99). e, Wake to REM, which rarely occurs in normal mice. Laser 
stimulation had no significant effect (P > 0.99). f, Wake to NREM. Laser 
stimulation caused a significant reduction in the transition probability 

(P < 0.001), indicating that during wakefulness vM GABAergic neuron activity 
has a wake-maintenance effect. g-l, Similar to a-f, for eYFP control mice. 
Laser stimulation had no significant effect on any transition probability 

(P> 0.05). 
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Extended Data Figure 5 | Pharmacogenetic inactivation of VM GABAergic 
neurons reduces REM sleep. a, Brain states in a control (vehicle injection) 
and a CNO session from an example mouse. The recording session started 
20 min after vehicle or CNO injection. b, Probability of each brain state during 
the first 2.5 h of the recording session, after injection of vehicle (grey) or two 
dosages of CNO (different shades of blue). Error bar, s.e.m. (n = 6 mice). 

*P < 0.05; **P < 0.01; one-way analysis of variance with post hoc Dunnett’s 
test. c, Similar to b, but during the second half of the recording session (2.5-5 h). 
There was no significant difference between control and CNO at any dosage 
(P > 0.12). d, Latency of first REM sleep episode (from the beginning of 

each recording session). e, Frequency of REM episodes during the first 2.5h of 
the recording session. f, Duration of REM episodes during the first 2.5h 

of the session. The reduction of REM sleep caused by pharmacogenetic 
inactivation of VM GABAergic neurons appears to be due to the reduction 

of frequency rather than duration of REM episodes. 
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Extended Data Figure 6 | Optogenetic identification of VM GABAergic 
neurons. 4, Left, reliability and temporal jitter of laser-evoked spikes in 
identified (blue) and unidentified (grey) units recorded in the vM. Note that 
the identified and unidentified units form two distinct clusters, with high 
reliability and low jitter for identified units. Dark/light symbols, data during 
30/15 Hz laser stimulation. Middle, distribution of delays of laser-evoked 
spiking for 21 identified GABAergic neurons. Delay is defined as timing of the 
first spike after each laser pulse. Right, distribution of correlation coefficient 
between laser-evoked and spontaneous spike waveforms for all 21 identified 
vM GABAergic neurons. b, Fluorescence image of a coronal section showing 
the position of an electrolytic lesion at the end of the optrode tract (arrow). 
Blue, DAPI staining. c, Positions of the 21 identified VM GABAergic neurons 
from 5 mice. Each dot indicates one neuron. All 20 neurons with recording 
periods encompassing all three brain states showed maximal firing rates during 
REM sleep. The wakeful behaviour during which the neuron showed the 
maximal firing rate is colour-coded. Black, neurons for which the recording 
period did not include all wakeful behaviours. Schemes of brain sections 
adapted from Allen Mouse Brain Atlas (Website: © 2015 Allen Institute for 
Brain Science. Allen Mouse Brain Atlas [Internet]. Available from: http:// 
mouse.brain-map.org). 
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Extended Data Figure 7 | Activity of VM GABAergic neurons at REM sleep 
onset and offset. a, Mean firing rates of VM GABAergic neurons at REM 
onset. Shading, s.e.m. Dark blue, period in which firing rate was significantly 
higher than baseline (P < 0.05, Wilcoxon signed-rank test); baseline was 
defined as the average firing rate during the 10s intervals 60s before and after 
REM sleep. b, Mean firing rates at REM offset. 
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Extended Data Figure 8 | Effect of laser stimulation of vM neurons on 
several wakeful behaviours. a, Probability of moving, running, eating, and 
grooming before, during, and after laser stimulation (20 Hz, 120s) in GAD2- 
Cre mice injected with AAV expressing ChR2-eYFP into the vM (n = 8 mice). 
Unclassified behaviours are not shown. Blue bar, period of laser stimulation. 
Laser stimulation caused a significant increase in eating (P = 0.008, Wilcoxon 
signed-rank test). b, Similar to a, but in control mice expressing eYFP 

(n =4 mice). 
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Extended Data Figure 9 | Firing rates of unidentified vM neurons. a, Firing 
rates of unidentified units in the three brain states. Each line represents data 
from one neuron. Gray bar represents average over units (n = 24). b, Left, 
relative firing rates of the units across brain states. The firing rates of each unit 
were normalized by its maximum. Right, normalized firing rates across 
different wakeful behaviours. c, Mean firing rates of unidentified vM neurons at 
REM onset. Shading, s.e.m. Dark blue, period in which firing rate was 
significantly higher than baseline (P < 0.05, Wilcoxon signed-rank test). 

d, Mean firing rates of unidentified vM neurons at REM offset. 
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Extended Data Figure 10 | Co-expression of GABA and glycine in vM 
neurons. a, Fluorescence image of ChR2-eYFP expressing neurons in the VM 
of a GAD2-Cre mouse injected with Cre-inducible AAV. b, Immunohisto- 
chemical staining for glycine. c, Superposition of eYFP expression and glycine 
staining. In total, 94% (273/289) of eYFP-expressing cells were glycine positive 
(n = 3 mice). 
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Encoding of action by the Purkinje cells of 


the cerebellum 


David J. Herzfeld', Yoshiko Kojima’, Robijanto Soetedjo” & Reza Shadmehr' 


Execution of accurate eye movements depends critically on the 
cerebellum’ ”’, suggesting that the major output neurons of the cere- 
bellum, Purkinje cells, may predict motion of the eye. However, this 
encoding of action for rapid eye movements (saccades) has remained 
unclear: Purkinje cells show little consistent modulation with 
respect to saccade amplitude** or direction’, and critically, their 
discharge lasts longer than the duration of a saccade*’. Here we 
analysed Purkinje-cell discharge in the oculomotor vermis of behav- 
ing rhesus monkeys (Macaca mulatta)*” and found neurons that 
increased or decreased their activity during saccades. We estimated 
the combined effect of these two populations via their projections to 
the caudal fastigial nucleus, and uncovered a simple-spike popu- 
lation response that precisely predicted the real-time motion of 
the eye. When we organized the Purkinje cells according to each 
cell’s complex-spike directional tuning, the simple-spike population 
response predicted both the real-time speed and direction of saccade 
multiplicatively via a gain field. This suggests that the cerebellum 
predicts the real-time motion of the eye during saccades via the 
combined inputs of Purkinje cells onto individual nucleus neurons. 
A gain-field encoding of simple spikes emerges if the Purkinje cells 
that project onto a nucleus neuron are not selected at random but 
share a common complex-spike property. 

Previous studies have focused on bursting activity of Purkinje cells 
during saccadess*’°”’ and found no consistent modulation with sac- 
cade amplitude**, speed*’ or direction*. A recent simulation’’ sug- 
gested that Purkinje cells that pause during saccades may be important 
in understanding the responses observed in the deep cerebellar nucleus 
neurons. The main question that we wished to address was how the 
Purkinje cells encode the real-time motion of the eye. 

We analysed simple-spike activity of 72 Purkinje cells in the oculo- 
motor vermis (OMV, cerebellar lobules VI and VII) of five monkeys 
during saccades. The population included cells that exhibited increased 
activity (bursting; n = 39, Fig. 1a) or decreased activity (pausing; n = 33, 
Fig. 1b). Consistent with previous reports**”°, most neurons were poorly 
modulated by saccade amplitude (Fig. 1c and Extended Data Fig. 1); 
however, the mean firing rate of burst cells (but not pause cells) increased 
significantly with saccade peak speed (Fig. 1d, P< 10° °). Previous work 
had demonstrated that the population response encoded additional sac- 
cade-related information that was not reliably present in the responses of 
individual neurons*’*"*. To examine the population response, we mea- 
sured change in firing rates (from baseline) for the bursting and pausing 
cells during slow (400° s ') and fast (650° s ') saccades (Fig. le), pooled 
across all directions. The onset of change in firing rates in both popula- 
tions generally led saccade onset by more than 50 ms. The termination of 
activity was also significantly 
later than the saccade: a 650°s ' saccade was 38 + 1.2 ms in duration 
(mean + s.e.m.), whereas activity of burst and pause cells persisted for 
more than 100 ms. Given that the cerebellum is thought to have a critical 
role in termination of ipsiversive saccades'*"*, it is unlikely that separate 
populations of burst or pause Purkinje cells control the motion of the eye, 
since their activity persists for much longer than the saccade. 


Purkinje cells project to the caudal fastigial nucleus (cFN), where 
about 50 Purkinje cells converge onto a cFN neuron’. For each 
Purkinje cell we computed the probability of a simple spike in 1-ms 
time bins during saccades of a given peak speed, averaged across all 
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Figure 1 | Population of burst and pause Purkinje cells together predict eye 
speed in real time. a, b, Perisaccade histograms for a bursting (a) and 
pausing (b) Purkinje cell during saccades of various speeds and directions (red 
arrow). The trace on the top row is saccade speed. The grey arrow indicates 
saccade end. c, d, Mean firing rates over the duration of saccade computed 
across all directions. Changes in speed produced an increase in the firing rate of 
the burst cells but not the pause cells. e, Change in firing rates (with respect 
to baseline) of the bursting and pausing Purkinje cells for two saccade 
speeds. Grey bars are onset and termination of the saccade (width is s.e.m.). 
f, The total rate of simple spikes produced by a random selection of 50 
Purkinje cells. 
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directions. We then chose 50 Purkinje cells at random and computed 
the total number of simple spikes generated by the population at each 
millisecond, resulting in an estimate of the rate of presynaptic spikes 
converging onto a cFN cell. The results (Fig. 1f) revealed a real-time 
encoding of the speed of the eye: the peak of the activity preceded peak 
speed, increased in magnitude when speed increased, and returned to 
baseline just before saccade termination (R° at the optimal delay, 
400°s ': R?=0.52, P<10 77; 650°s ': R?=0.62, P<10 *°). It 
appeared that the simple spikes of the pause and burst cells combined 
together to predict motion of the eye. 

Let us hypothesize that the Purkinje cells that project to a nucleus 
neuron are not selected randomly, but are organized by their inputs 
from the inferior olive'*. That is, suppose that the olive projections 
divide the Purkinje cells into clusters where each cluster of Purkinje cells 
projects onto a single nucleus neuron. The input from the olive produces 
complex spikes in the Purkinje cells. We found that if we organized the 
simple spikes of the Purkinje cells based on each cell’s complex-spike 
properties, additional features of the population activity were unmasked. 

We measured complex-spike properties of each Purkinje cell by 
inducing a post-saccadic error through displacement of the target 
during the saccade, and then measured the probability of complex 
spikes as a function of the direction of this error (Fig. 2 and 
Supplementary Information section 2). For each Purkinje cell, the 
direction of error that produced the largest probability of complex 
spikes during the 50-200-ms post-saccade period was labelled as 
CS-on, and the opposite direction was labelled as CS-off? (Extended 
Data Fig. 2). We then made the assumption that the Purkinje cells that 
projected onto a nucleus neuron all had the same CS-on direction 
(Fig. 3a). Under this assumption, we computed the rate of presynaptic 
simple spikes that a nucleus neuron would receive from the cluster of 
Purkinje cells (Supplementary Information section 3). We did this by 
convolving each Purkinje cell simple-spike train with a 2.5 ms stand- 
ard-deviation normalized Gaussian, approximating the temporal 
characteristics of the inhibition produced in the nucleus neuron due 
to a simple spike in the Purkinje cell’””. 

Figure 3b shows the change in population response from the base- 
line level when a saccade was made in the same direction as CS-off. The 
response rose above baseline before saccade onset, peaked before peak 
speed, and then returned to near baseline. The peak response scaled 
robustly with saccade amplitude (Fig. 3c, R? = 0.93, P< 10°). We 
observed a strong correspondence between the real-time population 
response and the real-time speed (Fig. 3d, lower plot, and Extended 
Data Fig. 3). The population response preceded eye speed by an aver- 
age of 21.2+0.4ms (correlation analysis in the CS-off direction, 
mean + s.e.m.). Peak population response precisely predicted peak 
speed (Fig. 3e, 2 = 0.98, P< 10°”). 

We took advantage of natural variability in saccades to test further 
the relationship between the population response and speed. We 
sorted all 10° saccades (direction CS-off) according to peak speed 
(Fig. 3f) and found that despite the constant amplitude, the population 
response precisely predicts the actual peak speed of the eye (Fig. 3g, 
R? =0.96, P< 10’). Therefore, when the simple spikes were orga- 
nized according to each cell’s complex-spike directional preference, 
the population response for saccades of constant direction predicted 
nearly all of the variability in saccade peak speed. 

No previous work, to our knowledge, had revealed how direction of 
a saccade is encoded in the activity of Purkinje cells. For the burst 
and pause cells, the mean and peak firing rates during the saccade 
did not vary as a function of direction (Extended Data Fig. 4). 
However, organizing the population response according to each 
Purkinje cell’s complex-spike tuning preference revealed a clear 
encoding of direction: the peak response was greater if the saccades 
were in the same direction as CS-off as compared to CS-on (Fig. 4a, 
t-test, P< 10 '®). Indeed, the peak population response rose linearly 
as a function of peak speed in both directions, but with a larger gain 
when the saccade direction was congruent with CS-off (Fig. 4b, 
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Figure 2 | Determination of complex-spike properties of Purkinje cells. 

a, Response of a Purkinje cell during the 250-ms period after completion ofa 
saccade (simple spikes are grey; complex spikes are red). CS-on was 
determined via a back-step paradigm in which the target was jumped 
(unfilled target to filled target) during saccade execution. Black arrow 
indicates saccade vector, red arrow indicates error vector. We computed the 
probability of complex spikes in the 50-200-ms period after saccade 
termination. b, The probability of a complex spike (Pr(CS)) as a function of 
the direction of the error vector. For this neuron, the highest probability (CS-on) 
occurred when the error vector was in direction —45°. The direction of 
CS-off for this cell was 135°. 


repeated measures analysis of variance (ANOVA) with main effects 
of peak speed, P< 10 1°; CS-direction P< 1077; and a speed by 
CS-direction interaction, P< 10° 1°). 

To examine the effects of saccade direction more closely, we plotted 
the population response across saccade directions with respect to 
CS-on (Fig. 4c). We found that the population response was highest 
for saccades made in the CS-off direction, with an encoding of direction 
that resembled a cosine function (Fig. 4d). Therefore, the combined 
activity of burst and pause cells, but not the activity of either population 
individually (Extended Data Fig. 4), aligned to CS-off, produced a 
population response that exhibited gain-field encoding: the magnitude 
of the population response increased linearly with speed, and was 
cosine-tuned in direction, with a multiplicative interaction between 
speed and direction. The rate of simple spikes converging onto cFN, 
represented by s(t), predicted in real-time motion of the eye 
(Supplementary Information section 4 and Extended Data Fig. 5) is: 


s(t) =|x(t+ A)|g(0, Acs) +e 
g(0, Ics) =a cos (0—Ocs) +b 


In equation (1), |x(t+ A)| represents the magnitude of the eye velocity 
vector (the time derivative of eye position, x) at time t+ A (where 
A= 19 ms), band care baseline offsets, a is a scaling factor, 0 is saccade 
direction, and 0¢s is direction of CS-off for that cluster of Purkinje cells. 
The resulting gain-field encoding of eye motion is depicted in Fig. 4e. 

We next addressed the question of how the activity of individual 
cells produced this directional encoding in the population response. 
The main contributors were the pause cells, which started their pause 
approximately 10 ms earlier when the saccade was in the CS-on dir- 
ection (Fig. 4f), a change that was independent of saccade speed 
(Extended Data Fig. 6). This subtle shift in the timing of spikes pro- 
duced an increase of the population response when saccade direction 
changed from CS-on to CS-off (Fig. 4a). 

We found that the anatomical distribution of Purkinje cells, as 
labelled by their CS-off direction, was not random, but lateralized? 
(Extended Data Fig. 7), confirming previous anatomical studies 
suggesting that olivary projections are contralateral”””’. Purkinje cells 
with rightward CS-off were more likely to be on the right side of the 
cerebellum (t-test, P< 10 “). This indicates that saccades made in 
the same direction as CS-off were typically ipsiversive, whereas saccades 
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congruent with CS-on were contraversive. In contrast, pause and burst 
cells were uniformly distributed across the cerebellum (P > 0.4). 

Our results rely critically on our hypothesis that Purkinje cells 
organize into clusters with roughly equal numbers of pause and burst 


Figure 4 | Population response of Purkinje cells predicted saccade speed and 
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Figure 3 | A cluster of Purkinje cells, organized 
by their complex spikes, produced a popula- 
tion response that predicted in real time the 
motion of the eye. a, Hypothesized organization 
of the oculomotor vermis. To compute a 
population response, we measured the simple 
spikes of each Purkinje cell as a function of saccade 
direction with respect to the CS-on direction of 
that cell. For the Purkinje cells shown here, the CS- 
on is an error vector to the left (arrow). b, Change 
in population response (with respect to baseline) 
as a function of saccade amplitude in 0.5° bins, for 
saccades in the CS-off direction. Data in the 
amplitude axis were smoothed by a first-order 
Savitzky-Golay filter with a width of three bins’. 
Bottom plot shows the population response for 
three representative amplitudes. c, Peak population 
response increased linearly with saccade 
amplitude. P values indicate significant linear 
correlation. d, Population response as a function 
of saccade peak speed. Bottom plot shows 
representative responses with their corresponding 
speed traces. e, Peak population response increased 
linearly with saccade peak speed. f, Population 
response for 10° saccades (+1°), as a function of 
saccade peak speed. Bottom plot shows the 
population response for slow, medium and fast 
saccades of 10° amplitude. g, Peak population 
response increased linearly as a function of 

peak speed even for a fixed magnitude saccade. 
Error bars are s.e.m. 


cells, all with a common complex-spike tuning preference (Fig. 3a). 
If, contrary to our hypothesis, pause and burst cells organized into 
separate clusters, the population response would not predict the real- 
time motion of the eye (Fig. le). Similarly, if each cluster was not 
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direction CS-on and CS-off. The population response is larger when the 

saccade is in the CS-off direction. b, Peak population response grew linearly 
with saccade speed, but had a higher gain for saccades in the CS-off direction. 
c, Real-time population response as a function of saccade direction relative 
to CS-on. Data smoothed as in Fig. 3b. d, Peak population response (labelled as 
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brown curve shows probability of observing a complex spike as a function of 
the angle relative to each neuron’s CS-on. The black curve indicates cosine 
fit of the peak population response. e, Gain-field encoding by a cluster of 
Purkinje cells whose CS-off direction is to the right (equation (1)). f, Contribu- 
tion of single Purkinje cells to the population response. A change in direction 
coincides with a shift in timing of the pause cells. Error bars are s.e.m. 
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composed of roughly equal numbers of pause and burst cells, the 
population response could not predict the real-time speed of the 
eye (Extended Data Fig. 8 and Supplementary Information section 5). 
The fact that burst and pause cells were distributed uniformly across the 
recording locations, and not lateralized as we found with the complex- 
spike tuning properties, suggests that a cluster is composed of both burst 
and pause. Finally, if we ignored the complex-spike properties of the 
Purkinje cells, and made the typical assumption that simple spikes were 
sufficient to uncover the coordinate system of encoding motion, then 
the gain-field representation of speed and direction would disappear 
(Extended Data Fig. 9 and Supplementary Information section 6). 

Organizing the Purkinje cell into clusters where all the cells shared a 
common complex-spike property resulted in simple spikes that 
encoded speed and direction in real time via a gain field. 

Together, our results suggest three principles of cerebellar function 
during control of saccadic eye movements. First, the cerebellum pre- 
dicts real-time motion not in the time course of individual Purkinje cell 
simple spikes, nor in the individual activities of the bursting or pausing 
populations, but in the combined activities of these two populations 
via the simple spikes that converge onto cells in the deep cerebellar 
nucleus. A similar population coding has been suggested during 
smooth pursuit’. Second, this population input to each nucleus neu- 
ron encodes direction and speed via a gain field. Because a similar 
encoding has been shown in the posterior parietal cortex during sac- 
cades”’, as well as in the motor cortex during reaching”, our obser- 
vation in the cerebellum suggests a common principle of encoding in 
disparate regions of the motor system. Finally, the gain-field encoding 
was present if we assumed a specific anatomical organization: a cluster 
of Purkinje cells that projected onto a single nucleus neuron was 
composed of approximately equal numbers of bursting and pausing 
Purkinje cells, all sharing a common complex-spike property. Because 
the complex spikes of a Purkinje cell are due to input from the inferior 
olive, the gain-field encoding predicts that the oculomotor vermis is 
organized into clusters of Purkinje cells that share similar climbing 
fibre projections from the inferior olive. This, in turn, suggests that 
motor memories are anatomically clustered in the cerebellum by the 
errors that were experienced during movements”. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

We analysed extracellular recordings from Purkinje cells of the oculomotor 
vermis in five rhesus monkeys (B, F, W, K, KO) while they made saccades to visual 
targets*” (Supplementary Information section 1). Each cell was well isolated for an 
average of 3,000 saccades. Briefly, a scleral search coil was surgically attached to the 
eye of each monkey, allowing measurement of eye position’® while the animal’s 
head was restrained. After surgery, the monkeys were trained to make saccades to 
targets of varying amplitudes and directions. We identified Purkinje cell activity in 
OMV by their saccade-related change in the simple-spike response, as well as the 
presence of complex spikes. Neurophysiological data were sampled at 50 kHz. All 
experiments were performed in accordance with the Guide for the Care and Use of 
Laboratory Animals (1997) and exceeded the minimal requirements recom- 
mended by the Institute of Laboratory Animal Resources and the Association 
for Assessment and Accreditation of Laboratory Animal Care International. All 
animal procedures were approved by the local committee at the University of 
Washington. 

The CS-on direction for each cell was determined using the standard intra- 
saccadic step paradigm”, in which the target was displaced during the initial 
saccade (Fig. 2). This error resulted in complex spikes during the period following 
the saccade, when the monkey observed the error. For every cell, we determined 
the CS-on direction as the error direction which elicited the largest probability of 
complex spikes during the 50-200 ms following the primary saccade. For n = 39 
cells we were able to maintain excellent isolation of the Purkinje cell throughout 
the experiment, allowing us to perform automated identification of complex spikes 
on every trial. This allowed us to compute the probability of complex spikes as a 
function of error direction. For the remaining n = 33 cells, the CS-on direction was 
determined via analysis of the initial 50 trials for each direction of error®. CS-off 
was defined as CS-on +180°. 

We computed firing rates by determining the inverse of the time between two 
consecutive simple spikes** and then convolved the resulting time series with a 
normalized Gaussian kernel with a standard deviation of 2.5 ms, which is signifi- 
cantly shorter than conventional kernels*. This guarded against overestimating the 
duration of a population response. We calculated the mean firing rate during the 
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saccade by taking the average of the firing rate from the start to the end of the 
saccade. We determined the peak response of a Purkinje cell as the maximum 
firing during the saccade period. 

The response of an individual Purkinje cell is quite variable*®, with some neurons 
showing a combination of bursting and pausing activity in the time period near 
saccade onset. Therefore, to categorize neurons exclusively as pausing or bursting, 
we compared the mean firing rate of each cell from the period 200-100 ms before 
saccade onset of all recorded trials to a 150-ms period centred on saccade peak 
speed. Neurons that reduced their rate during this extended period were classified 
as ‘pausing’, whereas neurons that significantly exceeded this rate were classified as 
‘bursting’. We tested this categorization statistically via a paired t-test with a cutoff 
of P = 0.05. Only two neurons (both bursting) did not pass this statistical test. We 
included these two neurons in the bursting population. 

To establish confidence intervals on the population responses, we performed a 
bootstrap analysis in which we randomly sampled 50 neurons from the available 
pool of 72 (with replacement), which simulated the approximate number of 
Purkinje cell inputs that project onto a nucleus neuron”. Error bars show mean 
+ s.e.m. of 50 bootstrapped Purkinje cell populations. In cases where we show the 
responses of the bursting/pausing populations separately, we report the mean + 
s.e.m. across neurons in the respective population without bootstrapping. 

To compute the population response of a cluster of 50 Purkinje cells, we first 
convolved the simple spikes of each Purkinje cell with a kernel (normalized 
Gaussian of 2.5 ms s.d.), approximately representing the time course of post- 
synaptic inhibition induced by the simple-spike train'’. We then computed the 
change from baseline for each cell, and finally the sum of changes across the 
population, resulting in a population response that had units of spikes s-', com- 
puted at each millisecond of time. 
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increasing saccade amplitude (positive slope) and some neurons decreased 
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statistics from n = 39 Purkinje cells in which the probability of complex spikes 
was quantified as a function of direction of the error vector and direction of 
the saccade that preceded that error. Probability of complex spikes depended on 
direction of error, not direction of saccade. 
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Extended Data Figure 4 | Mean and peak/trough firing rate of the burst and 
pause cells were poorly modulated by saccade direction. a, Maximum, 
minimum and mean firing rates averaged across burst or pause cells with 
respect to saccade direction, relative to CS-on direction of each cell. b, Mean 
firing rates of the burst and pause cells, as measured across all saccades, were 
not significantly different for saccades in the CS-on versus CS-off direction 
(burst P > 0.10, pause P > 0.05). c, Mean firing rates of the burst and pause 
cells as a function of saccade speed, for saccades in the CS-on versus CS-off 
direction. Saccade speed modulated the mean firing rates of the burst cells, 
but there were no significant interaction between saccade direction and speed 
(P > 0.6), nor a significant effect of saccade direction (P > 0.7). d, Peak 
(maximum) firing rates of the burst cells and the minimum firing rate of the 


Saccade peak speed (deg/s) 


pause cells as a function of saccade speed, for saccades in the CS-off and CS-on 
directions. We asked whether the maximum response of the burst cells or 
the minimum response of the pause cells was significantly modulated by 
direction. Separate repeated measures ANOVAs showed that for the burst cells, 
peak activity increased as a function of saccade peak speed (P < 0.001), but this 
relationship was unaffected by saccade direction (P > 0.4). For the pause 
cells, the response was not affected by saccade speed (P > 0.6), and this 
relationship was not modulated by saccade direction (P > 0.4). We found that 
saccade direction did not significantly alter the encoding of peak speed in 
either the mean or minimum/maximum activity of Purkinje cells. Error bars in 
all panels represent s.e.m. across neurons. 
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Extended Data Figure 5 | A population of Purkinje cells, organized by their 
complex spike properties, predicted the real-time speed of the eye better 
than activity of individual cells. a, We used equation (S2) (see Supplementary 
Information for details) and used the measured population response s(t) of 
Purkinje cells to predict the real-time speed of the eye |x(t+ A) I The plot 
shows the predicted speed for saccades of 400, 525 and 650° s_'. The predicted 
speed led the actual speed by 19 ms. MSE is the mean squared error between the 
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predicted and actual eye trajectory at the optimal value of A. b, The result of 
fitting equation (S2) (see Supplementary Information) to the response of 
individual neurons. c, The result of fitting equation (S2) (Supplementary 
Information) to the discharge of a population composed exclusively of burst 
cells. d, The result of fitting equation (S2) (Supplementary Information) to the 
discharge of a population composed exclusively of pause cells. 
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Extended Data Figure 6 | Change in saccade direction was associated with a _saccade onset. b, Within-neuron measure of pause onset for saccade in 
change in the timing of the reduction of discharge in the pause cells (that is, direction CS-on, minus onset from saccades in direction CS-off. 


pause onset) (see Fig. 4f). a, Timing of pause onset with respect to saccade Negative numbers indicate that the pause onset occurred earlier for 
onset for saccades of various speeds and directions. We computed the pause saccades in the CS-on direction. Error bars in all panels indicate s.e.m. across 
onset as the time when the neuron’s response reached 20% of its minimum neurons. 


response. Positive numbers indicate that the pause onset occurred before 
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Extended Data Figure 7 | Complex-spike-dependent organization of the 
Purkinje cells. a, Hypothesized anatomical organization of the oculomotor 
vermis (OMV). Bursting and pausing Purkinje cells are organized into clusters, 
with the cells in each cluster sharing a common complex-spike direction. 
Neurons on the right side of the OMV project to right cCFN neurons and have 
CS-off directions to the right. b, Distribution of the CS-off directions from 
recorded neurons in chamber coordinates. Vertical dotted line shows the line 
that best separates rightwards CS-off direction neurons (blue) from leftwards 
CS-off direction neurons (red). c, Probability of having a rightwards (blue), up/ 
down (green), or leftwards CS-off direction as a function of chamber 
coordinates. Purkinje cells with CS-off to the left were more probable on the left 
side of the cerebellum. Purkinje cells with CS-off to the right were more 
probable on the right side of the cerebellum. d, Pause (red) and burst (blue) 
Purkinje cells were equally likely at all recorded locations. 
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Extended Data Figure 8 | The population response was sensitive to the 
fraction of pause and burst cells that composed a cluster of Purkinje cells. In 
our data set, 54% of the population was composed of burst cells. We computed 
the population response under the assumption that the membership of a 
cluster was 54% burst cells. Here, we tested how sensitive the population 
response was to this membership ratio. The vertical lines indicate saccade onset 
and offset for all saccades pooled across direction and speed. As the percentage 
of burst cells in the cluster becomes larger than 70%, or smaller than 50%, 
the population response no longer returns to baseline at saccade offset. 
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Extended Data Figure 9 | Gain-field encoding of saccade kinematics in the 
population response of the Purkinje cells disappeared if the Purkinje 

cells were organized by their simple-spike activity. a, In this analysis we 
assumed that a collection of 50 Purkinje cells projected onto a single cFN 
neuron, with the property that all the Purkinje cells shared a similar simple- 
spike preferred direction. Therefore, the cluster was organized based on the 
simple-spike properties of the Purkinje cells, not their complex-spike 
properties. b, The population response for saccades made in the direction for 
which each Purkinje cell showed the largest mean firing rate (simple spikes), 
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for various saccade peak speeds. The peak population response was not 
modulated with saccade speed. Error bars are boot-strap-estimated s.e.m. 

c, The population response for saccades made in the direction of maximal 
modulation. For burst cells, this was the direction for which the Purkinje cell 
showed the largest mean firing rate, whereas for pause cells, this was the 
direction associated with the minimum activity (largest pause). The peak 
population response was not modulated with saccade speed when clusters were 
organized based on the direction of maximal simple-spike modulation. 
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n-Secretase processing of APP inhibits neuronal 
activity in the hippocampus 
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Alzheimer disease (AD) is characterized by the accumulation of 
amyloid plaques, which are predominantly composed of amyloid-B 
peptide’. Two principal physiological pathways either prevent or 
promote amyloid-B generation from its precursor, B-amyloid pre- 
cursor protein (APP), in a competitive manner’. Although APP 
processing has been studied in great detail, unknown proteolytic 
events seem to hinder stoichiometric analyses of APP metabolism 
in vivo’. Here we describe a new physiological APP processing 
pathway, which generates proteolytic fragments capable of inhib- 
iting neuronal activity within the hippocampus. We identify higher 
molecular mass carboxy-terminal fragments (CTFs) of APP, 
termed CTF-n, in addition to the long-known CTF-o and CTF-B 
fragments generated by the a- and f-secretases ADAM10 (a disin- 
tegrin and metalloproteinase 10) and BACE1 (f-site APP cleaving 
enzyme 1), respectively. CTF-y generation is mediated in part by 
membrane-bound matrix metalloproteinases such as MT5-MMP, 
referred to as n-secretase activity. n-Secretase cleavage occurs 
primarily at amino acids 504-505 of APP«9;, releasing a truncated 
ectodomain. After shedding of this ectodomain, CTF-1 is further 
processed by ADAM10 and BACEI to release long and short An 
peptides (termed An-a and An-B). CTFs produced by 1-secretase 
are enriched in dystrophic neurites in an AD mouse model and in 
human AD brains. Genetic and pharmacological inhibition of 
BACE1 activity results in robust accumulation of CTF-n and 
An-a. In mice treated with a potent BACE1 inhibitor, hippocampal 
long-term potentiation was reduced. Notably, when recombinant 
or synthetic An-a was applied on hippocampal slices ex vivo, long- 
term potentiation was lowered. Furthermore, in vivo single-cell 
two-photon calcium imaging showed that hippocampal neuronal 
activity was attenuated by An-a. These findings not only dem- 
onstrate a major functionally relevant APP processing pathway, 
but may also indicate potential translational relevance for thera- 
peutic strategies targeting APP processing. 

To identify new proteolytic pathways of APP, we searched for CTFs 
other than those giving rise to P3 fragment (CTF-«) or amyloid-B 
(CTF-B)*°. A new CTF with an approximate molecular mass of 
30 kilodaltons (kDa) was revealed, which was recognized by an antibody 
to the C terminus of APP (Y188) and was absent in the brains of APP- 
knockout mice® (Fig. la and Supplementary Table 1). The molecular 
mass of the novel CTF suggests an additional physiological cleavage of 
APP amino-terminal to the known cleavage sites of B- and «-secre- 
tases, which we named accordingly n-cleavage of APP (Extended Data 


Fig. 1). In the soluble fraction, we detected the N-terminal cleavage 
product (sAPP-n; Extended Data Fig. 2), with a molecular mass of 
approximately 80 kDa that distinguishes it from alternative 
N-terminal APP fragments described previously’. In addition, we 
observed lower molecular mass soluble peptides (An), which presum- 
ably derived from processing of CIF-n by BACE1 (An-f) or 
ADAM10 (An-«), or alternatively from sAPP-o/B cleavage 
(Fig. 1b). An was identified in the soluble fraction of mouse brains 
as several closely spaced peptides by antibody M3.2 (Fig. 1b), dem- 
onstrating that some of these fragments contain the N-terminal part 
of the amyloid-B domain and probably end at the «-secretase cleavage 
site. An fragments were further validated by antibody 9478D directed 
against an epitope N-terminal to the amyloid-B domain (Fig. 1b). 
Consistent with n-secretase cleavage of APP in wild-type mice, we 
observed increased CTF-1 and An production in brain homogenates 
of APPPS1-21 transgenic mice’ (Fig. 1c, d). Furthermore, antibody 
192swe selectively identified the An-B species ending at the BACE1 
cleavage site (Fig. 1d). Consistent with increased BACE1 cleavage of 
Swedish mutant APP, only minor amounts of An-a were detected in 
this mouse model with antibody 2D8 (Fig. 1d). Physiological n-secre- 
tase processing was further confirmed in cerebrospinal fluid (CSF) 
from humans with and without the Swedish mutation (APP.we) 
(Fig. le). Fivefold higher An than amyloid-B levels were observed 
in human CSF (5.33 + 1.39 times (mean + s.d.) higher An than amy- 
loid-B estimated by 2D8 blot signals; n = 7, P> 0.0001, Student’s 
t-test). Using antibody 192swe, An-f was selectively detected in the 
CSF of patients with the Swedish mutation, whereas antibodies 2E9 
and 2D8 detected An-« in all analysed samples (Fig. le). Moreover, 
while these peptides are generated by 1-secretase cleavage N-terminal 
to the amyloid-B domain, they do not reach the y-secretase site (see 
mass spectrometric analysis in Fig. 1f), demonstrating that they are 
different to the previously described N-terminally extended amyloid- 
B variants’*’”. Membrane-bound matrix metalloproteinases such as 
MT1-MMP and MT5-MMP (also known as MMP14 and MMP24, 
respectively) were shown to cleave APP in vitro at a site consistent 
with the molecular mass of n-secretase processing products'*"*. We 
therefore produced a neo-epitope-specific antibody (10A8; Extended 
Data Fig. 2) to identify the n-secretase cleavage site. Antibody 10A8 
detected a protein corresponding to sAPP-n with an approximate 
molecular mass of 80 kDa in mouse brain lysates, which was absent 
in the APP-knockout mouse brain (Extended Data Fig. 2). Thus, 
n-secretase cleavage of APP may occur in vivo at least in part at amino 
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Figure 1 | A novel APP proteolytic processing pathway. a, A 30-kDa APP 
CTF-n is detected in the brains of adult and postnatal mice (P10, postnatal day 
10). APP-FL, full-length APP; KO, knockout; WT, wild type. b, An was 
detected in the soluble (sol.) fraction of adult and P10 mice by antibody M3.2 
and 9478D (antibody 9478D may not be sensitive enough to detect the lower 
An levels in adult brain). c, Higher levels of CTF-17 are observed in APPPS1-21 
mouse brains as compared to wild-type in the membrane (mem.) fraction. 
Short and long exposures indicated. Background bands are indicated by 


acids 504-505 of APP¢o95. To determine the N- and C-terminal cleav- 
age sites of An peptides, we performed immunoprecipitation with 
antibodies 9476M, 9478D, 2D8 and 2E9 (Extended Data Fig. 3a, b). 
Isolated An peptides were digested with three different proteases to 
produce several overlapping peptides, and analysed by mass spectro- 
metry. We identified several peptides covering the entire sequence 
between the cleavage site N504-M505 of APP 95 starting with the 
sequence MISEPRISY after the 1-secretase cleavage site (Fig. 1f). 
Mass spectrometry also supports the C-terminal cleavages at the B- 
and a-secretase sites. After immunoprecipitation with 2D8, frag- 
ments of the amyloid-B domain were also observed in smaller 
amounts alongside the novel An peptides (Fig. 1f and Extended 
Data Fig. 3c). As the 1-secretase cleavage site at amino acid 505 of 
APP is consistent with the previously described in vitro cleavage sites 
of APPg95 by MT1-MMP and MT5-MMP (refs 13, 14), we investi- 
gated brains from MT5-MMP- and MT1-MMP-knockout mice’*’° 
(Extended Data Fig. 4). Whereas MT1-MMP knockout had no 
marked effect on An)-o levels (Extended Data Fig. 4b), the generation 
of An-% was reduced in brains from MT5-MMP-knockout mice 
(Extended Data Fig. 4c). Furthermore, after MT5-MMP overexpres- 
sion in murine N2a cells, a selective increase in An-« peptide of 
approximately 16 kDa was observed (data not shown). Thus, MT5- 
MMP displays 1-secretase activity in intact mouse brains, although 
the contribution of other n-secretases must be considered. 

While investigating protease inhibitors capable of blocking 
1-secretase, we observed that pharmacological BACE1 inhibition led 
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asterisks. d, Soluble extracts of APPPS1-21 mouse brains contained An species, 
as detected by 2E9 antibody. An-PB.we was selectively detected by antibody 
192swe. e, An and amyloid-B (AB) were readily detectable in human CSF by 
antibody 2D8. Antibody 2E9 allowed the detection of An in all samples, 
whereas 192swe specifically detected AN)-Bwe. £, Mass spectrometric analysis of 
An. Peptide intensities were summed per amino acid residue and plotted in 
relation to each other. 


to a pronounced accumulation of the long An-o species in Chinese 
hamster ovary (CHO) 7PA2 cells (Fig. 2a). This indicates that after 
blockade of f-secretase activity, processing by a-secretase leads to 
enhanced production of the long An-« species at the expense of the 
shorter BACE1-generated An-f. Similarly, BACE1 inhibition also 
led to an accumulation of endogenous CTF-1 and enhanced produc- 
tion of endogenous An- in primary mouse hippocampal neurons 
(Fig. 2b, c), as well as human neurons differentiated from embryonic 
pluripotent stem cells (H9 cells'’; Fig. 2d-g). Furthermore, in 
human neurons we not only detected a 65% increase (Fig. 2g) in the 
slightly longer A1n-« species after BACE inhibition, but also a con- 
comitant decrease in An-B peptides (Fig. 2e, top). Importantly, 
1)-secretase processing significantly exceeded amyloidogenic proces- 
sing (9.5 + 1.87 times An compared to amyloid-B estimated for 
human neurons; n = 8, P > 0.001, Student’s t-test). Pharmacological 
intervention with a BACE] inhibitor in vivo caused a clear and time- 
dependent increase in CTF-1 and An-o levels in APPy717; transgenic 
mice’® (Fig. 2h), which was fully reversible within 24h after admin- 
istration. In agreement, increased CTF-n and A1n-« levels were also 
observed in a BACE1-knockout mouse”? in vivo (Fig. 2i). 

To identify a potential contribution of n-secretase processing to AD 
pathology, immunohistochemical analyses were performed with 
brains derived from six-month-old APPPS1-21 (ref. 20) mice 
(Extended Data Fig. 5a). This revealed co-labelling of antibody Y188 
with antibody 2E9 in dystrophic neurites. No signal for An peptides 
was obtained in plaque cores in which aggregated amyloid-f was 
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Figure 2 | Inhibition of BACE1 results in increased levels of CTF-y and An-a. 
a, Conditioned media from 7PA2 cells treated with or without a BACE1 
inhibitor (BI) was co-migrated with synthetic An-B and An-a peptides and 
immunoblotted with 2D8 and 2E9 antibodies. b-f, After BACE inhibition in 
mouse hippocampal neurons and human neurons, a reduction of sAPP-B was 
accompanied by a strong increase in endogenous An-o and CTF-n levels. 
Lys., lysate; sup., supernatant. g, Quantification of intensities for 2D8 signals 
in e (n = 8; ***P < 0.001, Student’s t-test). h, BACE1 inhibition in vivo 
resulted in enhanced production of An-« species in APPy717; mice. 
Background bands are indicated by asterisks. i, Western blot analysis of soluble 
extracts of P10 Bacel~’— mouse brains revealed a marked increase in An-o 
peptides as compared to controls. 


detected by 6E10 staining (Extended Data Fig. 5a). Similar data were 
obtained in human AD brains (Extended Data Fig. 6). To verify accu- 
mulation of CTF-1 in dystrophic neurites, we used laser capture 
microdissection (LCM). Western blot analysis revealed not only 
CTF-B and CTF-«, but also CTF-1 within the halo, but not within 
the plaque core area or regions devoid of plaques (Extended Data 
Fig. 5b). As expected, amyloid-f was observed within the plaque core 
as well as in the surrounding halo (Extended Data Fig. 5b). 

Since the cleavage products of n-secretase APP processing accu- 
mulate after BACE] inhibition and are enriched in dystrophic neur- 
ites, we examined whether soluble A1 peptides interfere with neuronal 
function, similar to soluble amyloid-f oligomers’. Long-term poten- 
tiation (LTP) is considered as a synaptic correlate of memory, and is 
widely used as a model for investigating the neurotoxic effects of 
amyloid-B oligomers on synaptic function”’”’. A single oral dose of 
the BACE1 inhibitor SCH1682496 increased the CTF-n levels, and 
almost doubled the Ar-o level in soluble brain extracts prepared from 
animals 3h after treatment (Extended Data Fig. 7). This was accom- 
panied by a significant reduction of hippocampal LTP (Fig. 3a, b). 
These findings may suggest an involvement of An-« in LTP deficit 
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Figure 3 | An-a impairs hippocampal LTP. a, Pharmacological BACE1 
inhibition with SCH1682496 lowers hippocampal LTP. DMSO, dimethyl- 
sulfoxide; fEPSP, field excitatory postsynaptic potential; HFS, high-frequency 
stimulation. b, Representative fEPSPs recorded in CA1 area before and 45 min 
after tetanization of Schaffer collaterals (top), with summary plot of the 
effects of the inhibitor (SCH1682496) and vehicle (DMSO) on fEPSP slopes in 
all examined groups in a (n = 9). ¢, Soluble An-« and An-f peptides were 
expressed in CHO cells and blotted with 2D8, 2E9 and 9478D antibodies. 
d-f, An-o (n = 9) (d), but not An-f (n = 7) (e), conditioned media 

from untransfected cells (CHO; n = 13) or artificial cerebrospinal fluid (ACSF) 
(control; n = 15) (f) significantly inhibited LTP. g, Summary graph of LTP 
magnitudes calculated 45-60 min after high-frequency stimulation from 
graphs in d-f with statistical analysis (*P <0.05; one-way analysis of varaince 
(ANOVA) and post hoc Bonferroni test); error bars represent s.e.m. For each 
condition, sample fEPSP traces pre-LTP (black) and 45-60 min post-LTP 
(grey) induction are shown (top). 


under acute blockade of B-secretase activity. To validate the potential 
effects of An peptides on synaptic transmission and plasticity directly, 
we expressed An-B and An-« in CHO cells (Fig. 3c). Concentrated 
conditioned media was further enriched for An by size-exclusion 
chromatography (SEC) and applied to hippocampal slices before 
LTP induction in CA1 pyramidal neurons. Neither An-B nor An-o 
influenced the baseline synaptic transmission (Extended Data Fig. 8). 
Comparison of LTP 60 min after its induction in the presence of An-f or 
An- with control conditions (Fig. 3d—g) revealed that An-o lowered the 
LTP to a degree comparable to synthetic amyloid-B dimers” (Extended 
Data Fig. 9a, b), while truncated An-f had no effect (Fig. 3e, g). In 
support of this observation, synthetic An-« reduced LTP to a similar 
extent at a concentration of 100nM (Extended Data Fig. 9c, d). To 
examine the direct effects of An peptides on neuronal activity in vivo, 
we used two-photon calcium imaging at single-cell resolution”. 
Figure 4a-d illustrates results from experiments in which the activity 
of neurons in the CA1 pyramidal cell layer was monitored before and 
after superfusion of the exposed hippocampus with An peptides or the 
respective control peptides. An-« strongly suppressed the activity of 
hippocampal neurons in vivo, an effect not observed with An-f or the 
control peptide (Fig. 4a-d and Extended Data Fig. 10). By using local 
application of synthetic An-« to hippocampal neurons, we demonstrate 
that the inhibitory effect of An-o on neurons was readily reversible after 
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washout (Fig. 4e). A summary of the results from all experiments is 
shown in Fig. 4f, g. 

We have identified a new APP processing pathway that exceeds 
amyloidogenic processing. Similar to amyloid-B production”, the 
alternative proteolytic processing pathway occurs under physiological 
conditions but may be altered during AD pathogenesis. Accumulation 
of 1-secretase” and CTF-1 within dystrophic neurites in close vicinity to 
neuritic plaques may also support its potential contribution to AD patho- 
logy. However, all APP and presenilin-associated familial AD mutations 
affect amyloid-8 production and aggregation (reviewed in ref. 26), 
whereas the Icelandic mutation APP,673r7 prevents AD and dementia 
by moderately reducing amyloid-B production”. Indeed, the Swedish 
mutation decreased An-o by strongly enhancing BACE1-mediated 
APP processing. However, 1)-secretase accumulation” and its activity 
near amyloid plaques may suggest that 1)-secretase stimulation by amy- 
loid-B could be a downstream effector within the amyloid cascade. 
Although An may be involved in the modulation of neuronal activity 
and synaptic plasticity, the differential bioactivity of recombinant An-« 
and A1-f is currently unclear. One may speculate that the longer An-o 
peptide is more stable owing to unknown post-translational modifica- 
tions. This would be consistent with our observation that, in contrast to 
cell-produced An-f, 100 nM of synthetic An-B inhibits LTP (data not 
shown). Finally, it is important to note that low-n amyloid-B oligomer 
preparations from 7PA2 supernatants contain considerable amounts of 
Ayn (data not shown). Thus, the previously observed inhibition of LTP 
with such fractions may also be attributed to the presence of An. Our 
findings may also be considered in the context of continuing clinical trials 
with BACE] inhibitors. Together with the identification of numerous 
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Figure 4 | Ay-a reduces neuronal activity in vivo. 
a-d, Left, in vivo images of CA1 hippocampal 
neurons labelled with the fluorescent calcium 
indicator fluo-8AM. Middle and right, calcium 
transients of five representative neurons, marked in 
the corresponding left panels, before and during 
bath-application of An or CHO conditioned 
media. e, Calcium transients in hippocampal 
neurons before, during and after local application 
of synthetic An-a. f, g, Summary results of the 
changes in the average rates of calcium transients (f), 
and in the fractions of silent neurons (g); for statistics 
see Supplementary Tables 2 and 3. Error bars 
represent s.e.m. 
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brain-specific BACE1 substrates”, our data indicate that therapeutic 
inhibition of BACE1 activity requires careful titration to prevent 
unwanted adverse effects at several levels. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Cell culture. Mycoplasm-free CHO and 7PA2 (ref. 31) cells (gift from E. Koo) 
were grown in DMEM/F12 (Thermo Scientific) supplemented with 10% FCS 
(Thermo Scientific) plus penicillin/streptomycin and non-essential amino acids 
(PAA) in a humid incubator with 5% CO, at a temperature of 37 °C. For inhibitor 
treatment, cell culture medium was replaced with fresh, pre-warmed serum free 
medium (OPTIMEM; Thermo Scientific) supplemented with inhibitors or DMSO 
as vehicle control. Treatment was initiated when cells reached 90-100% con- 
fluency and conditioned media were collected after 20-24 h. Supernatants were 
cleared by centrifugation (10 min, 5,500g at 4°C). To obtain cell lysates, cell 
monolayers were washed once with ice-cold PBS and detached in 1 ml PBS using 
a cell scraper. The cell suspension was pelleted by centrifugation (5 min, 1,000g at 
4 °C) and lysed with RIPA buffer (20 mM sodium citrate, pH 6.4, 1 mM EDTA, 1% 
Triton X-100 in ddH,O) supplemented with Protease Inhibitor Cocktail (Sigma- 
Aldrich). The protein concentration of lysates was determined using the Uptima 
BC Assay Protein Quantitation kit (Interchim). 

Primary cell culture. Hippocampal neurons were isolated from embryonic day 18 
CD rats (Charles River) as described previously’. Dissociated neurons were plated 
at 17,700 cellscm™* onto 6-cm dishes coated with poly-L-lysine (1 mgml 7’; 
Sigma) and cultured in Neurobasal medium supplemented with 2% B27 and 
0.5 mM L-glutamine (all from Invitrogen). Hippocampal cultures were maintained 
ina humidified 5% CO, incubator at 37 °C. For inhibitor treatment, DIV16 culture 
medium was replaced with fresh, pre-equilibrated N2 medium (supplemented 
with 20% of 4 days conditioned N2 medium from pure primary cultured astro- 
cytes) to which inhibitors or DMSO as vehicle control were added. 

Generation and BACE1 inhibition of cerebral cortex neurons induced from 
human embryonic stem cells. Cell lines in this study were H9 ES (WiCell 
Research Institute). Pluripotent cells were cultured on mouse embryonic 
fibroblasts (GlobalStem) in DMEM/F12 containing 20% (v/v) KSR, 100m 
non-essential amino acids, 100 1M 2-mercaptoethanol, 50 U ml! penicillin and 
50mg ml * streptomycin (Life Technologies) and 10ngml~' FGF2. Directed 
differentiation of human embryonic stem cells to cerebral cortex neurons was 
carried out as described’”**. Human neurons (75 days after induction) were 
treated with 141M f-secretase inhibitor LY2886721 (Selleck) dissolved in 
DMSO (20mM< stock). Vehicle-only control assays were performed using 
DMSO. The compound was applied twice at 48-h intervals. Extracellular media 
was collected before drug addition and at subsequent 48-h intervals. Neurons were 
collected after 4 days of treatment using 0.5 mM EDTA in PBS. 

Transgenic mice, animal care and animal handling. Bacel/~ and APPPS1-21 
mice were described before'®”” and were bred for this study in a BI6C57/J back- 
ground. All treatments were approved by the local committee for animal use and 
were performed in accordance to state and federal regulations (license number 
KVR-1I/221-TA116/09). Mice had access to pre-filtered sterile water and standard 
mouse chow (Ssniff Ms-H, Ssniff Spezialdiaten GmbH, Soest, Germany) ad libi- 
tum and were housed under a reversed day-night rhythm in IVC System Type II 
L-cages (528 cm”) equipped with solid floors and a layer of bedding, in accordance 
to local legislation on animal welfare. 

BACE] inhibitor treatment. Randomized APPy,7;7, (ref. 18) mice were treated 
with vehicle or with the inhibitor RO5508887 provided by Hoffmann-La Roche”. 
The groups of treated mice were blinded to the examiner and uncoded at the end of 
the experiments. Three-month-old heterozygous female transgenic mice in mixed 
FVB/N X C57B1/6J background expressing human APPy7j7; (ref. 18) were used 
for BACE1 inhibition studies. Gavage mediated administration of BACE1 inhib- 
itor (90 mgkg” *, 14.06 ml kg”) or vehicle (14.06 ml kg” ') was performed once*. 
The BACE1 inhibitor was diluted in 5% ethanol (Merck) and 10% solutol (Sigma- 
Aldrich) in sterile water (Baxter). Animals were sacrificed after 5, 8 and 24h. Mice 
were anaesthetized with 3.5 ul per gram body weight of a mixture of ketamine 
(115 mg ml’ ketamine hydrochloride, Eurovet), xylazin 2% (23.32 mg ml xyla- 
zine hydrochloride, VMD Arendonk), atropine (0.50 mg ml ' atropine sulphate, 
Sterop) and saline (8:5:2:5, v/v/v/v). For brain preparation, mice were flushed 
trans-cardially with ice-cold saline (3.5 ml min™ 1 3 min). The brain was removed 
from the cranium and dissected into left and right hemiforebrain, brainstem, 
cerebellum and olfactory bulb. The brain structures were promptly immersed in 
liquid nitrogen and stored at —80°C. Different tissues (kidneys, spleen, liver, 
stomach, gut, lungs and heart) were examined and checked for gross abnormal- 
ities. No obvious abnormalities were observed in any of the treatment groups. 
Preparation of protein extracts from brain. Brains were removed from the 
cranium and dissected into left and right hemispheres. Brain tissue was snap- 
frozen in liquid nitrogen and stored at —80 °C. Soluble proteins were extracted 
with DEA buffer (50mMNaCl, 0,2% diethylamine, pH 10, plus protease 
inhibitor (P8340, Sigma-Aldrich))**, membrane proteins were extracted with 
RIPA buffer (20mM Tris-HCl, pH 7.5, 150mMNaCl, 1mM EDTA, 1mM 
EGTA,1% NP-40,1% sodium deoxycholate, 2.5mM sodium pyrophosphate 


plus protease inhibitor) or applying a membrane preparation protocol as 
described before’’. 

Protein analysis. Proteins were separated under denaturing conditions using 
discontinuous SDS-PAGE. Equal amounts of proteins denatured in Laemmli 
buffer were loaded onto the gel and 1041 of the SeeBlue Plus2 Prestained 
Standard (Invitrogen) served as molecular mass marker. Electrophoresis was per- 
formed in Tris-glycine buffer (25 mM Tris, 190 mM glycine in ddH20) using the 
Mini-PROTEAN system (BIORAD) on activated PVDF membranes. Low 
molecular mass proteins (<16kDa) were separated using precast gradient 
Tricine Protein Gels (10-20%, 1mm, Novex) in Tris-tricine buffer using the 
XCell SureLock Mini-Cell system (Novex). After separation by SDS-PAGE, pro- 
teins were transferred onto membranes using the tank/wet Mini Trans-Blot cell 
system (BIORAD). CTFs, Arn and amyloid-B were detected after transfer on 
Nitrocellulose membranes (Protran BA85; GE Healthcare), while other proteins 
were blotted on PVDF (Immobilon-P, Merck Millipore). As size markers for An 
synthetic peptides An-B (92 amino acids; 1-MISEPRISYGNDALMPSLTETKT 
TVELLPVNGEFSLDDLQPWHSFGADSVPANTENEVEPVDARPAADRGLTT 
RPGSGLTNIKTEEISEVKM-92) and the slightly longer Arn-o (108 amino acids; 
1-MISEPRISYGNDALMPSLTETKTTVELLPVNGEFSLDDLQPWHSFGADSV 
PANTENEVEPVDARPAADRGLTTRPGSGLTNIKTEEISEV KMDAEFRHDSG 
YEVHHQK- 108) were obtained from Peptide Speciality Laboratories. After com- 
pletion of the transfer and before blocking, proteins transferred to nitrocellulose 
membranes were additionally denatured by boiling the membrane in PBS 
(140 mM NaCl, 10mM Na ,HPO,, 1.75mM KH,PO,, 2.7mM KCl in ddH20, 
pH 7.4) for 5 min. After cooling to room temperature, nitrocellulose membranes 
as well as the PVDF membranes were blocked in I-Block solution (0.2% Tropix 
I-Block (Applied Biosystems), 0.1% Tween20 in PBS) for 1 h at room temperature 
or overnight at 4°C (with agitation). Transferred proteins were detected using 
immunodetection and enhanced chemiluminescence (ECL). First, blocked mem- 
branes were incubated with primary antibodies diluted in I-Block solution over- 
night at 4 °C (with agitation). After removal of the antibody, membranes were 
washed three times in TBS-T buffer (10 min each, at room temperature, with 
agitation; 140 mM NaCl, 2.68 mM KCl, 24.76 mM Tris, 0.3% Triton X-100 in 
ddH20, pH 7.6) and subsequently incubated with a horseradish-peroxidase- 
coupled secondary antibody (obtained from Promega or Santa Cruz). Secon- 
dary antibodies were diluted in I-Block solution and membranes were incubated 
for 1h at room temperature (with agitation) followed by three washes in TBS-T. 
For ECL detection, membranes were incubated with horseradish peroxidase sub- 
strate (ECL, GE Healthcare or ECL Plus, Thermo Scientific) for 1 min at room 
temperature and signals were captured with X-ray films (Super RX Medical 
X-Ray, Fujifilm), which were subsequently developed using an automated film 
developer (CAWOMAT 2000 IR, CAWO). Quantitation of protein was con- 
ducted using Image] software. Ratios were obtained from signals on the same 
film for Ar over amyloid-f. Quantitative data were analysed statistically by using 
a two-tailed Student’s t-test. 

Molecular cloning and transfection. For the expression of An-o and An-f in 
CHO cells, the complementary DNAs of the respective fragments were amplified 
by PCR and subcloned into the pSecTag2A (Invitrogen) vector that features an 
N-terminal secretion signal. CHO cells were cultured in DMEM with 10% FCS 
and non-essential amino acids. Transfections were carried out using Lipofecta- 
mine 2000 (Invitrogen) according to the manufacturer’s instructions. 

Mass spectrometry analysis of samples. Beads with immunoprecipitated pep- 
tides were resuspended in ddH,O and reduced with 10 mM dithiothreitol followed 
by alkylation with 55mM 2-chloroacetamide. Samples were divided into three 
parts and digested with either 1 ug of trypsin (1.2 M urea, 0.4M thiourea and 
50mM ammonium bicarbonate), LysC (5M urea, 1.7M thiourea and 50mM 
ammonium bicarbonate) or chymotrypsin (0.3M urea, 0.1M thiourea and 
50 mM ammonium bicarbonate). To increase the sequence coverage further, par- 
tially cleaved peptides were generated by digesting for 5, 10, 20, 40, 60, 120, 180 and 
720 min. Samples from all time points of a respective protease were pooled and 
desalted on stage tips”. 

For liquid chromatography-tandem mass spectrometry (LC-MS/MS), peptides 
were separated on a Thermo Scientific EASY-nLC 1000 HPLC system (Thermo 
Fisher Scientific) and in-house packed columns (75 1m inner diameter, 20cm 
length, 1.9 um C18 particles (Dr. Maisch GmbH). The peptide mixture was loaded 
in buffer A (0.5% formic acid) and separated with a gradient from 10% to 60% 
buffer B (80% acetonitrile, 0.5% formic acid) within 40 min at 250 nl min“! at a 
column temperature of 50°C. A Quadrupole Orbitrap mass spectrometer” (Q 
Exactive, Thermo Fisher Scientific) was coupled to the HPLC system via a nano 
electrospray source. We used data-dependent acquisition with a survey scan range 
of 300 to 1,650 m/z, at a resolution of 60,000 m/z and selected up to five most 
abundant features with a charge state =2 for HCD fragmentation” at a normal- 
ized collision energy of 27 and a resolution of 15,000 at m/z 200. To limit repeated 
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sequencing, dynamic exclusion of sequenced peptides was set to 20 s. Thresholds 
for ion injection time and ion target values were set to 20 ms and 3 X 10° for the 
survey scans, and 120 ms and 1 X 10° for the MS/MS scans. Data were acquired 
using the Xcalibur software (Thermo Scientific). 

Data analysis. To process mass spectrometry raw files, we used the MaxQuant 
software (v1.5.2.16)". We used the Andromeda search engine”, which is inte- 
grated into MaxQuant, to search MS/MS spectra against the APP. 9; and 247 
common contaminating proteins’. We set enzyme specificity to unspecific to 
detect novel cleavage sites and set a peptide search length from 7 to 40 amino 
acids. A false discovery rate cutoff of 1% was applied at the peptide level. For data 
visualization we used R*’. Identified peptides were mapped to APP¢g95. To display 
quantitative evidence for overlapping peptides, intensities of identified peptides 
were summed and plotted per amino acid residue. The data of the individual 
immunoprecipitation and mass spectrometry analyses are depicted in Extended 
Data Fig. 3. 

Human CSF samples. Human CSF samples collected at the Department of 
Neurology Outpatient unit for neurodegenerative disease (KBFZ) of the 
University of Bonn were obtained by lumbar puncture at position L3, centrifuged 
and divided in small aliquots. For further analysis, samples were stored at — 80 °C. 
Turbid or blood-contaminated samples were excluded from analysis. Use of these 
samples for research purposes has been consented by all patients according to the 
ethical committee requirements of the University of Bonn Ethical committee and 
approval number 279/10. For the analysis of APPywe carriers with antibodies 
192swe (ref. 44) lumbar CSF was obtained from family members. Tubes with 
CSF were stored at —70°C until analysis. The clinical diagnosis of probable AD 
was based on NINCDS-ADRDA criteria*’. The diagnosis of AD was confirmed by 
neuropathological examination of the brain of one deceased mutation-carrier**”. 
This study was approved by research ethics committee at the Uppsala University 
Hospital (Dnr 048-2005). 

Neuropathology and immunohistochemistry. Use of brain samples for research 
purposes has been consented by all patients according to the ethical committee 
requirements of the University of Ulm Ethical committee and approval number 
54/08. Braak-NFT stages**, and CERAD™ scores for neuritic plaques were used to 
determine the degree of AD pathology according to the NIA-AA guidelines”. 
Consecutive paraffin sections from the human medial lobe were stained with 
22C11, 9476M and 9478D. Primary antibodies were detected with biotinylated 
anti-mouse and anti-rabbit IgG secondary antibodies and visualized with avidin- 
biotin-complex (ABC-Kit, Vector Laboratories) and diaminobenzidine-HCl 
(DAB). The sections were counterstained with haematoxylin. Positive and nega- 
tive controls were performed. 9476M and 9478D stainings were assessed in 10 
control and 10 AD patient cases. 

For double immunofluorescence analysis of APPPS1-21 brain sections, 

6-month-old mice were killed by CO, inhalation according to animal handling 
laws. Brains were dissected and fixed with 4% paraformaldehyde in 0.1 M PBS, 
pH 7.4 for 48h. For immunohistochemistry, 25-1m-thick sagittal mouse brain 
cryosections were treated with 10mM sodium citrate, pH 6 at 95 °C for 20 min, 
washed with 0.5% Triton X-100 in PBS, blocked with 5% goat serum (Invitrogen) 
and 0.5% Triton X-100 in PBS for 1 h and subsequently incubated overnight with 
primary antibodies diluted in blocking solution. Primary antibodies were used as 
listed in Supplementary Table 1. DAPI was used to counterstain nuclei. Signals 
were visualized using fluorescently labelled secondary antibodies. Confocal images 
were acquired using a Plan-Apochromat 25X/0.8 oil differential interference con- 
trast objective on a LSM 710 confocal microscope (Zeiss) in sequential scanning 
mode using ZEN 2011 software package (black edition, Zeiss). 
LCM of plaque enriched brain material. For laser capture microdissection of 
plaque cores and halos, 10-, 11-, 14-, 16- and 24-month-old transgenic APPPS1-21 
mice were used according to a previously published protocol” with slight mod- 
ifications. Mice brains were dissected and immediately frozen on crushed dry ice. 
Ten-micrometre-thick sagittal sections were cut using a Microm HM 560 cryostat 
(Thermo Scientific), mounted on frame slides containing a 1.4 jum polyethylene 
terephthalate membrane (Leica Microsystems) and subsequently stained or stored 
at —80 °C for later usage. Staining was performed as follows: brain sections were 
thawed briefly at room temperature, fixed with 75% ethanol for 1 min, stained with 
0.05% Thioflavin-S for 5 min, washed with 75% ethanol and dried at room tem- 
perature. LCM was performed on the same day using a laser dissection microscope 
(Leica, LMD 7000) with the following settings: excitation wavelength 495 nm, laser 
power 30, aperture 5, speed 6 and pulse frequency 119. From each animal, at least 
800 plaque cores and halos, dissected from 12 brain sections were cut using a 63X 
magnification objective, collected in 0.5 ml caps (Leica Microsystem) and subse- 
quently pooled for protein analysis. Areas containing no plaques were cut using a 
10X magnification objective and were used as controls. Protein lysates were done 
essentially as described above using RIPA with 0.1% SDS. 
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Slice preparation and electrophysiological recordings applying An peptides in 
vitro. Transverse hippocampal slices (350 jum) were prepared from P20-30 Swiss 
mice following standard procedures”. Slices were cut in ice-cold oxygenated 
(95% O2, 5% COz) solution containing 206 mM sucrose, 2.8 mM KCl, 1.25 mM 
NaH,PO,, 2mM MgSO,, 1mM MgCh, 1 mM CaCl, 26mM NaHCO,, 0.4mM 
sodium ascorbate and 10mM glucose, pH 7.4. For recovery (1h), slices were 
incubated at 27°C in oxygenated standard ACSF containing: 124mM NaCl, 
2.8mM KCl, 1.25mM NaH,PO,, 2mM MgSO,, 3.6mM CaCl, 26mM NaHCOs, 
0.4mM sodium ascorbate, 10 mM glucose (pH 7.4)’. Slices were inspected in a 
chamber on an upright microscope (Slicescope, Scientifica Ltd) with infrared 
differential interference contrast illumination, and were perfused with the oxyge- 
nated ACSF at 27 + 1 °C. fEPSPs were recorded in the stratum radiatum of the 
CAI region using a glass electrode (filled with 1 M NaCl, 10 mM HEPES, pH 7.4) 
and the stimuli (30% of maximal fEPSP) were delivered to the Schaeffer Collateral 
pathway by a monopolar glass electrode (filled with ACSF). Electrodes were 
specifically placed just below the surface of the slice to maximize the exposure 
to circulating peptides. A minimum of 15-20 min stable baseline was first 
obtained in standard ACSF followed by another 15-20 min of bath application 
of ACSF containing SEC fractions (CHO, An-« or An-f; 1/15 dilution, inter- 
leaved recordings) using re-circulation with a peristaltic pump at 2.5-3 ml min™! 
while being continuously aerated with 95% oxygen. No alterations in fEPSP 
baseline responses were observed after incubation with the SEC fractions 
(Extended Data Fig. 8). In the continuous presence of ACSF/SEC solution, LTP 
was induced using a high-frequency stimulation protocol with two pulses of 
100 Hz for 1s with a 20s interval between pulses, and recorded for 1 h. Control 
recordings (no application of SEC fractions) were obtained in an interleaved 
fashion in which ACSF was re-circulated using an identical procedure. For 
LTP analysis, the first third of the fEPSP slope was calculated in baseline condition 
(15-20 min before induction of LTP) and compared to that after LTP induction 
(60 min after tetanization of Schaffer collaterals). The average baseline value was 
normalized to 100% and all values of the experiment were normalized to this 
baseline average (1-min bins). Experimental data were pooled per condition and 
presented as mean + s.e.m. Data analysis was performed with the Clampfit soft- 
ware (Molecular Devices). The test samples were blinded to the investigator and 
uncoded at the end of the experiments. Statistical analysis was performed using 
GraphPad (Prism 6) with the last 15 min of the recordings compared to measure- 
ments of 15-20 min of baseline, using a two-tailed Student’s t-test for statistical 
analysis on two samples or one-way ANOVA and post hoc Bonferroni test for 
statistical analysis on three and more samples, with P < 0.05 taken as statistically 
significant. No power analysis was done to estimate sample size, and there was no 
randomization. 

Electrophysiological recordings of the effects of BACE1 inhibitor in vitro. 
BACEI inhibitor (100 mgkg™', single gavage) or vehicle-treated mice (12 weeks 
old) were deeply anaesthetized with isoflurane (1% in Oz) and decapitated with 
brains rapidly extracted and placed for 5-6 min in ice-cold bubbled (95% O2, 5% 
CO,) slicing solution (inmM): 75 sucrose, 85 NaCl, 2.5 KCI, 1.25 NaHPO,, 
25 NaHCOs, 0.5 CaCl, 4MgCl, 25 glucose, pH 7.4. Coronal slices (400 lm) 
containing the hippocampus were cut (VT1200S; Leica) and transferred into a 
warming chamber (35 °C) filled with bubbled solution of the same composition, 
except sucrose was omitted and NaCl increased to 125mM (30 min). This was 
followed by the transfer of slices into recording ACSF (in mM): 125 NaCl, 2.5 KCl, 
1.25 NaH2POuq, 25 NaHCOs, 2 CaCh, 2 MgCh, 25 glucose. Recordings of fEPSP 
were made from the hippocampal CA1 area. A glass bipolar stimulating electrode 
was placed in the stratum radiatum of CA2-CA3 subfields to stimulate Schaffer 
collaterals with 0.2 ms current pulses at 0.033 Hz (A-360, WPI), with evoked 
responses recorded in the stratum radiatum of CA1 area. Incrementing current 
pulses (0.2 mA) were used for obtaining stimulus-response relationship graphs. 
Stable baseline and LTP recordings were made using one-half of the maximal 
stimulus intensities; LTP was induced by high-frequency stimulation of Schaffer 
collaterals with 10 trains of 10 pulses at 100 Hz applied, with 2-s inter-train 
intervals. Signals were filtered at 5 kHz, digitally sampled at 10 kHz and stored 
for offline analysis. The relative slope and peak amplitude of evoked fEPSPs were 
measured using FitMaster (HEKA Electronics). The groups of treated mice were 
blinded to the investigator and uncoded at the end of the experiments. A one-way 
ANOVA and post hoc Bonferroni test have been used for statistical analysis, with 
P < 0.05 taken as statistically significant. 

In vivo two-photon Ca”* imaging. All experimental procedures were in com- 
pliance with institutional animal welfare guidelines and were approved by the state 
government of Bavaria, Germany. The animal preparation procedure was similar 
to that described previously”. In brief, C57BI/6 mice (male or female, ~P40) 
were anaesthetized with isoflurane (1-1.5%) and placed onto a warming plate 
(37-38 °C). The skin was removed and a custom-made recording chamber 
was glued to the exposed skull. A craniotomy (~1mm) was made over the 
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hippocampus (2.5 mm posterior to bregma, 2.2 mm lateral to the midline) and a 
small portion of the overlying cortical tissue was carefully removed by aspiration. 
The animal was placed under a microscope on a warm heating plate (37-38 °C) 
and kept anaesthetized with low-levels of isoflurane (~0.8%). Respiratory and 
pulse rates were continuously monitored. The recording chamber was perfused 
with warm normal Ringer’s solution containing 125mM NaCl, 4.5mM KCl, 
26mM NaHCOs3, 1.25mM NaH>PO,4, 2mM CaCl, 1mM MgCl and 20mM 
glucose (pH 7.4 when bubbled with 95% O, and 5% CO ). The exposed CA1 
region of the hippocampus was then stained with fluo-8AM (AAT Bioquest; 
0.6 mM) using the multi-cell bolus loading technique™. 

In vivo imaging was performed with a custom-built two-photon microscope 
equipped with a Ti:sapphire laser system (Coherent; laser wavelength 925 nm), a 
resonant scanner and a Pockel’s cell for laser intensity modulation. Full-frame 
images were acquired at 30 Hz using a water-immersion objective (Nikon; 40X, 
0.8 numerical aperture). Data acquisition was controlled using custom-written 
software based on LabVIEW (National Instruments). Image analysis was per- 
formed off-line by using custom routines in LabVIEW and Igor Pro 
(Wavemetrics). Cellular regions of interest were drawn around individual somata, 
and then relative fluorescence change (AF/F) versus time traces were generated for 
each region of interest. Ca** transients were identified as changes in AF/F that 
were three times larger than the s.d. of the noise band. 

To assess the effects of An peptides on neuronal activity in vivo, the peptides or 
the respective controls (SEC fractions obtained from untransfected CHO cells or a 
synthetic peptide (46 amino acids; 1-ADSVPANTENEVEPVDARPAADRGL 
TTRPGSGLTNIKTEEISEVKM-46) of a middle part of An were added to the 
normal Ringer’s solution used for perfusion of the recording chamber (bath- 
application technique; 45-60 min each wash-in). In a subset of experiments, syn- 
thetic An-« (92 amino acids; 1-MISEPRISYGNDALMPSLTETKTTVELLPV 
NGEFSLDDLQPWHSFGADSVPANTENEVEPVDARPAADRGLTTRPGSGLT 
NIKTEEISEVKM-92) was applied locally by gentle pressure injection through a 
glass pipette that was placed close to the neurons of interest (local application 
technique; 40 s each pressure injection). 

The samples were blinded to the investigator and uncoded at the end of the 
experiments. Statistical analysis was performed using SPSS. The statistical meth- 
ods used were the Student’s t-test and the Fisher’s exact test. P < 0.05 was con- 
sidered statistically significant. 
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Extended Data Figure 1 | Schematic presentation of the n-secretase processing pathway. Schematic representation of the n-secretase pathway (left) as 
compared to the amyloidogenic pathway (right). Antibodies used in this study are indicated. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


MMP cleavage site 


f 


N-DVLAN?°/ M>?°>ISEP-C 
10A8 epitope 


Soluble proteins 


WT APPKO 


kDa 
Q8 m—— 


10A8 


oe <4 sAPP-1 
64— 


Extended Data Figure 2 | n-Secretase cleavage at Met505 of APP¢os- MMP 
proteins can cleave human APP; at the indicated position (arrow) between 
amino acids N504 and M505 in the N-terminal domain. The epitope for 

the neo-epitope-specific antibody 10A8 is indicated (grey line). sAPP-n was 
specifically detected in diethylamine (DEA; 0.2% diethylamine in 50 mM NaCl, 
pH 10) extracts of P10 wild-type mouse brain using antibody 10A8, but 

was absent in APP-knockout brains. Of note, antibody 10A8 failed to detect 
sAPP-a/B, confirming its selectivity for the 1-cleavage site. 
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Extended Data Figure 3 | Mass spectrometry analysis of An peptides. 

a, b, After removal of sAPP-« from conditioned media of CHO 7PA2 cells using 
appropriate centricon filters, the flow-through was used to isolate An peptides 
by immunoprecipitation. Synthetic peptides (1 ng per lane) were loaded to 
indicate the respective sizes of An-« and An-f. An peptides were captured 
with antibodies 9476M and 9478D directed against the putative N-terminal 
epitopes (Supplementary Table 1), 2E9 against a middle domain of An 

and 2D8 (which also immunoprecipitates amyloid-f). 2D8 detection 

(a) revealed that all antibodies captured peptides positive for the N-terminal 
part of the amyloid-B domain in a molecular mass range of synthetic An-o 
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between 12 and 16 kDa. The same samples analysed with 2E9 (b) confirmed the 
presence of An in all samples, with the lowest levels when precipitated with 
9476M. Note that unlike 2D8 antibody, 2E9 allows the additional detection 
of An-B. (Asterisks denote IgG.) c, A heat map of peptides identified after 
analytic proteolysis by mass spectrometry analysis, with 7PA2 supernatants 
immunoprecipitated with 2D8, 2E9, 9476M and 9478D antibodies. Arrows 
indicate peptides that start exactly with the amino acid sequence C-terminal of 
the respective cleavage sites of the 1-secretase, B-secretase or o-secretase site. 
Chy, chymotrypsin; LysC, protease LysC; tryp, trypsin. 
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Extended Data Figure 4 | MT5-MMP displays 1-secretase activity in brain. _ by the lack of signals in western blots. Calnexin served as a loading control. 


a-c, MTI-MMP ~~ and MT5-MMP “~ (also known as Mmp14 ‘~ and Soluble An levels, detected by antibodies 9478D and M3.2 (An-c) were 
Mmp24-~, respectively) mice were analysed for changes in 1)-secretase unchanged in MT1-MMP~’~ mouse brains (b), but reduced in MT5-MMP /— 
activity. Membrane and soluble proteins from P10 mouse brains were analysed. mouse brains (c). Total levels of secreted APP (22C11), sAPP-« or sAPP-[ were 
a, In RIPA lysates, no changes in APP and BACE] levels were detected in unchanged (b, ¢). 


knockout brains. MT1- and MT5-MMP were selectively knocked out as shown 
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Extended Data Figure 5 | Accumulation of CTF-y in dystrophic neurites. APPPS1-21 brain sections (n = 5) bearing thioflavin-S-positive amyloid-B 
a, Immunohistological stainings of cortical sections of 6-month-old APPPS1-21 plaque core (P) and the surrounding amyloid-f plaque halo (H). As a control 


transgenic mice (n = 3) revealed 6E10-positive amyloid-B plaque cores (C), brain areas devoid of plaques were used. While amyloid-f was readily 
(encircled) surrounded by dystrophic neurites positive for 2E9 (white detected by antibody 2D8 in lysates containing plaque-enriched material and 
arrowheads, top) and 9476M (white arrowheads, middle). Y188 (bottom panel) _ halo regions (fractions P and H; bottom), CTF-1 was selectively detected in 
staining co-localized with 2E9-positive signal (yellow arrowheads, bottom). the lysates prepared from the region enriched in dystrophic neurites (H; top), 
Nuclei were counterstained with DAPI. Scale bar, 10 Lum. b, Accumulation of but not detected in plaque or control regions (P or C). As expected, 

CTF-n fragment in dystrophic neurites of 14-month and 24-month-old CTF-B/« species are also enriched in dystrophic neurites (H). 


APPPS1-21 mice. Western blot analysis of material obtained by LCM of 
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Extended Data Figure 6 | Dystrophic neurites in AD brains are positive Immuno-positive signals were observed with 22C11 (a), 9478D (c) and 9476M 
for An-epitope antibodies. a—-f, Immunohistochemistry with 22C11 (e) antibodies in the somata and neuropils of a normal and AD brain. In AD 
(a, b), 9478D (c, d) and 9476M (e, f) antibodies in the human hippocampus brains, these antibodies decorate dystrophic neurites (b, d, f, denoted 
(CA1-subiculum region) of a control case (a, ¢, e) and an AD case (b, d, f). by arrowheads). Scale bar, 30 um. NP, neuritic plaque. 
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Extended Data Figure 7 | Increased Ax levels after acute treatment with a 
BACE1 inhibitor. a, In membrane lysates of brains obtained from animals 
treated with BACE1 inhibitor (BI, 100 mg kg! SCH1682496), an increase in 
CTF-1 was observed, which was paralleled by a strong reduction of CTF-f, 
while CTF-o was unchanged. APP-FL and BACE] signals remained 
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unchanged (asterisk indicates background band). Calnexin served as a loading 
control. b, In the soluble fraction, BACE] inhibition resulted in enhanced An-a 
levels and reduced sAPP-f levels, indicating efficient BACE1 inhibition. 
c, Production of An-o species, which was detected by antibody M3.2, revealed a 
95.4% increase after BACE1 inhibition. n = 3; P < 0.01, Student’s t-test. 
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Extended Data Figure 8 | An-a and An-f derived from CHO cells did not = An-B SEC fractions were perfused over mouse hippocampal slices after 
influence baseline activity at the hippocampal CA3-CA1 synapse. Soluble obtaining a 15-min stable baseline of a fEPSP at the CA3-CA1 synapse. The 
An-a and An-f peptides were expressed in CHO cells and collected in baseline remained unchanged for another 15 min when slices were incubated 
OPTIMEM medium. a, b, SEC fractions containing An were diluted (1:15) in — with CHO-cell-derived recombinant An-« (a) or An-f (b). 

ACSF for the treatment of hippocampal slices and LTP measurements. An- or 
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Extended Data Figure 9 | ABs26c dimers and synthetic An-a impair 
hippocampal LTP. a, In line with previous findings”, ABs26c cross-linked 
dimers (containing cysteine instead of serine at residue 26; 100 nM final; JPT 
Peptide Technologies; diluted in 25 ml re-circulating ACSF) reduced LTP as 
compared to interleaved control LTP recordings in 25 ml re-circulating 
ACSF. b, Illustrated is the average LTP magnitude (at 45-60 min after LTP 
induction) normalized to pre-LTP baseline values (100%) in untreated and 
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treated conditions (***P < 0.001; Student’s t-test). c, Treatment with synthetic 
An-o (100 nM final; Peptide Speciality Laboratories; diluted in 25 ml 
re-circulating ACSF) reduced LTP as compared to interleaved control LTP 
recordings in 25 ml re-circulating ACSF. d, Illustrated is the average LTP 
magnitude (at 45-60 min post-LTP induction) normalized to pre-LTP baseline 
values (100%) in treated and untreated conditions (*P < 0.05; Student’s t-test). 
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Inhibition of Glil mobilizes endogenous neural stem 


cells for remyelination 


Jayshree Samanta', Ethan M. Grund!, Hernandez M. Silva’, Juan J. Lafaille?, Gord Fishell! & James L. Salzer! 


Enhancing repair of myelin is an important but still elusive thera- 
peutic goal in many neurological disorders*. In multiple sclerosis, 
an inflammatory demyelinating disease, endogenous remyelina- 
tion does occur but is frequently insufficient to restore function. 
Both parenchymal oligodendrocyte progenitor cells and endogenous 
adult neural stem cells resident within the subventricular zone are 
known sources of remyelinating cells’. Here we characterize 
the contribution to remyelination of a subset of adult neural stem 
cells, identified by their expression of Glil, a transcriptional 
effector of the sonic hedgehog pathway. We show that these cells 
are recruited from the subventricular zone to populate demyelin- 
ated lesions in the forebrain but never enter healthy, white matter 
tracts. Unexpectedly, recruitment of this pool of neural stem cells, 
and their differentiation into oligodendrocytes, is significantly 
enhanced by genetic or pharmacological inhibition of Glil. 
Importantly, complete inhibition of canonical hedgehog signalling 
was ineffective, indicating that the role of Glil both in augmenting 
hedgehog signalling and in retarding myelination is specialized. 
Indeed, inhibition of Glil improves the functional outcome in 
a relapsing/remitting model of experimental autoimmune ence- 
phalomyelitis and is neuroprotective. Thus, endogenous neural 
stem cells can be mobilized for the repair of demyelinated lesions 
by inhibiting Glil, identifying a new therapeutic avenue for the 
treatment of demyelinating disorders. 

Remyelination in the adult human and mouse brains is performed 
by two cell types: oligodendrocyte progenitor cells (OPCs) and neural 
stem cells (NSCs). OPCs, which are present in the parenchyma of 
healthy brain as well as in, and around, multiple sclerosis lesions’, 
can be identified by their expression of the NG2 proteoglycan and 
platelet-derived growth factor receptor alpha (PDGFR-«). They 
respond locally to demyelination by generating oligodendrocytes 
although do not migrate long distances during remyelination*”. 
NSCs present in the subventricular zone (SVZ), express glial fibrillary 
acidic protein (GFAP) and Nestin, and are normally quiescent. In 
response to demyelination, cells in the adult SVZ can generate oligo- 
dendrocytes®, including, presumptively, in patients with multiple 
sclerosis’. 

The signals that activate and recruit NSCs to lesion sites and pro- 
mote their local differentiation into oligodendrocytes remain poorly 
understood. A candidate to regulate NSCs is sonic hedgehog (Shh), an 
important morphogen during CNS development that is required for 
the generation of most oligodendrocytes during development’ and for 
the maintenance of stem cells in the adult SVZ’. Shh is therefore an 
attractive candidate to expand the pool of premyelinating cells avail- 
able for repair. Indeed, Shh levels have been reported to increase in 
remyelinating lesions”®. 

Canonical Shh signalling is mediated by interactions of the hedge- 
hog receptor patched (Ptc) with the G-protein-coupled transmem- 
brane co-receptor smoothened (Smo). Binding of Shh to Ptc relieves 
its inhibition of Smo and thereby activates the Gli family of zinc-finger 
transcription factors'’. Of the three Gli proteins, Glil is the only one 


whose transcription is driven by Shh signalling and its expression is 
therefore considered a sensitive readout of sustained, high-level activa- 
tion of this pathway’*”’. 

In this study, we have examined remyelination by the Shh-responsive 
(that is, Glil*) pool of NSCs, which is concentrated in the ventral 
SVZ and comprises ~25% of NSCs'“. To genetically fate-map Glil~ 
NSCs, we crossed Glil"=8?/* mice with the Rosa-CAG-EGFP 
(RCE) reporter’* to generate Gli1™ mice in which tamoxifen treat- 
ment results in permanent expression of cytoplasmic green fluor- 
escent protein (GFP) in all Glil-expressing cells and their 
progeny’*”*; see Supplementary Table 1 for a summary of these and 
other mouse lines used in this study. GFP-labelled cells correspond to 
NSCs in the SVZ and a subset of astrocytes, but not OPCs or oligo- 
dendrocytes. We then followed the fate of these GFP™ cells after 
inducing demyelination in the mouse corpus callosum (CC) either 
by (1) dietary cuprizone”* or (2) direct, stereotactic injection of the 
detergent lysophosphatidyl-choline (LPC)*. 

At 6 weeks of dietary cuprizone, corresponding to peak demyelination, 
GFP-expressing cells were recruited to areas of demyelination. In con- 
trast, no labelled cells were observed in the CC of controls (Fig. 1a). At 
2 weeks of remyelination, after removal of cuprizone from the diet, GFP- 
expressing cells in the CC (17.3 cells + 2.6 per section) differentiated 
exclusively into glia, primarily oligodendroglia, namely PDGFR-o.* 
OPCs (9.8 + 8.7%) and CC1* oligodendrocytes (40.2 + 15.1%), as 
well as GFAP-expressing astrocytes (15.5 + 4.1%, Fig. 1b, c); other 
markers are shown in Extended Data Fig. la. Approximately 30% of 
the GFP* cells in the CC remained unspecified at this time. None of 
the GFP-labelled cells expressed neuronal (NeuN) or microglial 
(Ibal and CD11b) markers (data not shown). Ten weeks after recov- 
ery from cuprizone diet, the numbers of GFP“ cells in the demyeli- 
nated CC increased from about 17 to 48cells per section and 
consisted of PDGFR-«* OPCs (18.4 + 1.8%), CC1* oligodendro- 
cytes (58.3 + 5.3%) and GEAP* astrocytes (28.7 + 11.6%) (Fig. 1c), 
accounting for all the GFP ~ cells in the CC. These results suggest that 
Glil * NSCs continue to generate glial cells in the CC for a prolonged 
period after demyelination. In addition to the cells within the CC, 
GFP-labelled cells located outside the CC frequently increased with 
demyelination; these correspond to a subset of protoplasmic astro- 
cytes that are responsive to Shh”. 

The newly generated, NSC-derived oligodendrocytes remyelinated 
axons, as evidenced by GFP-labelled processes that flanked nodes of 
Ranvier (Fig. le), overlapped with the paranodal marker Caspr 
(Extended Data Fig. 1b), and co-expressed myelin oligodendrocyte 
glycoprotein (MOG) and myelin basic protein (MBP), but not 
the Schwann cell myelin protein PO (Extended Data Fig. 1b). 
Immunoelectron microscopy of the CC in Glil“""""”* mice crossed 
to a membrane GFP reporter (Rosa-TdTomato-mGFP) demonstrated 
GFP labelling in compact myelin sheaths surrounding axons (Fig. le), 
corroborating that these NSCs form remyelinating oligodendrocytes. 

For comparison, we also fate-mapped the entire pool of NSCs, in 
healthy brains and after demyelination using a Nestin“’* driver line 


1New York University Neuroscience Institute, Department of Neuroscience and Physiology, New York University School of Medicine, New York, New York 10016, USA. 2The Kimmel Center for Biology and 
Medicine of the Skirball Institute, New York University School of Medicine, New York, New York 10016, USA. 
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Figure 1 | Glil-expressing cells are recruited to, and generate myelinating 
oligodendrocytes at sites of demyelination. a, Serial sections from the 
brains of Glil“’* mice on a control or cuprizone diet (at the peak of 
demyelination) were stained with Black-Gold myelin (left) or immunostained 
for GFP (right). Inset, top left: a coronal section of the forebrain; the box 
highlights the area of CC analysed. Black arrow, lower left: a demyelinated 
region of the CC. Labelled cells in the CC are restricted to the site of 
demyelination (white arrow, lower right). N = 5 mice per group. Scale bar, 
100 kum; inset, 40 1m. b, Comparison of Glil CE/* mice with Nestin®’* mice 
shows GFP* cells are present in the CC of Nestin“’* mice but not of Gli1@’* 
mice on control diet (top). During recovery from cuprizone, GFP* cells appear 


that preferentially labels the SVZ' (Fig. 1b). We also observed GFP* 
cells outside the SVZ, including some cells within the healthy CC 
(Fig. 1b). The cells present in the healthy CC were largely astrocytes 
(93.81 + 3.34%); a small proportion were OPCs (1.81 + 1.57%) and 
oligodendrocytes (4.42 + 0.29%) (Fig. 1b, d). Two weeks after recovery 
from cuprizone, there was a striking increase in the numbers of labelled 
cells in the CC, consistent with a recent report’, associated with an 
increase in the percentages of OPCs (28.49 + 18.52%) and oligoden- 
drocytes (9.35 + 3.25%) and a commensurate reduction in the pro- 
portion of astrocytes (59.72 + 9.54%) (Fig. 1b, d). These results do not 
distinguish whether the oligodendrocytes present in the CC during 
remyelination were generated primarily from NSCs, from pre-existing 
precursors within the CC or both. They emphasize that, in contrast to 
the broader Nestin® pool of NSCs, Glil demarcates a distinct set of 
NSCs in the SVZ that are recruited only upon demyelination and 
preferentially fated to oligodendroglia. 

To assess whether cells that enter demyelinated lesions in the CC were 
actively responding to Shh, we analysed Glil expression using Glil”“ 
mice, which express nuclear LacZ from the Glil locus’’. In healthy 
brains, Glil was expressed by cells in the cortex, basal forebrain and 
ventral SVZ (Extended Data Fig. 2a, b). NSCs that co-expressed GFAP 
and Glil were also present in the human SVZ (Extended Data Fig. 2c). 
Outside the SVZ, labelled cells co-expressed GFAP (data not shown) but 
not PDGFR-« (Extended Data Fig. 2b) and thus correspond to a subset 
of mature astrocytes but not OPCs, as previously reported!”. No LacZ* 
cells were present in the CC during or after recovery from cuprizone- or 
LPC-mediated demyelination (data not shown), further indicating that 
Shh-responsive cells do not arise from within the callosum. Thus, 
although NSCs in the SVZ actively respond to Shh, their progeny, upon 
entry into the callosum, do not. These results are consistent with the 
minimal expression of Gli proteins by OPCs or oligodendrocytes during 
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in the CC of Gli1~”* mice (bottom) and increase in the CC of Nestin 
N=5 mice per group/genotype. Scale bars, 100 um; inset, 40 um. 
c, d, Percentage of GFP* oligodendrocyte progenitors (PDGFR-2"), mature 
oligodendrocytes (CC1*), and astrocytes (GFAP*) in the CC of Gli1@’* 

(c) and Nestin©”’* (d) mice. N = 5 mice per group/genotype. e, Ten weeks after 
cessation of cuprizone, GFP-labelled processes flank nodes of Ranvier, 
demarcated by B4-spectrin (top). N= 3 mice. Scale bar, 10 um. 
Immunoelectron microscopy shows membrane-targeted GFP (mGFP), 
indicated by immunogold particles, present within compact myelin sheaths 
that surround axons (bottom). N = 3 mice. Scale bar, 100 nm. Data are 

mean + s.e.m. 


CE/+ os 
mice. 


development (http://web.stanford.edu/group/barres_lab/brain_rnaseq. 
html) and indicate Shh signalling is decreased in OPCs during both 
normal development and active remyelination. 

A time-course analysis of fate-mapped cells in Glil mice 
strongly suggests NSCs emigrate from the SVZ into the CC after 
demyelination (Extended Data Fig. 2d-f) and argues against trans- 
differentiation of Glil-expressing astrocytes in the cortex as a source 
of labelled cells. Together, these data indicate that a Shh-responsive 
pool of NSCs in the SVZ are recruited specifically to the demyelinated 
CC where they downregulate Glil and differentiate into mature, mye- 
linating oligodendrocytes. 

Downregulation of Glil expression by remyelinating SVZ-derived 
OPCs raised the possibility that inhibiting Shh signalling might further 
augment remyelination—a notion that contrasts with previous studies 
showing that Shh signalling can promote repair after CNS injury” and is 
required for remyelination’®. We first analysed the effects of a partial loss 
of Shh signalling by fate mapping NSCs in Glil null (Gli1@"”"""“;RCE) 
versus heterozygous (Glil“'”*;RCE) mice. Consistent with our hypo- 
thesis, there were many more GFP-labelled cells in the remyelinating 
CC of Glil nulls (63+ 6.5 per section) than in Glil heterozygotes 
(17.3 + 2.6 per section, Fig. 2a, b), and a much higher percentage of 
these were mature oligodendrocytes in the nulls (81.3 + 4.4%) than in 
the heterozygotes (40.2 + 15.1%, Fig. 2c). Overall, there were ~ 7.5-fold 
more GFP-labelled, mature oligodendrocytes in the Glil null versus 
heterozygotes mice. The proportion of labelled OPCs in the nulls versus 
heterozygotes was similar (5.22+2.5% versus 9.8+ 8.7%, Fig. 2c) 
whereas that of GFAP-expressing astrocytes was significantly reduced 
(2.6 + 1.3% versus 15.5 + 4.1%, Fig. 2c). Finally, overall myelin levels in 
the CC were increased in the Glil nulls compared with heterozygotes 
3 weeks after cessation of cuprizone (Fig. 2d), strongly suggesting this 
enhanced NSC remyelination is physiologically significant. 
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Figure 2 | Loss of Glil enhances oligodendrogliogenesis during 
remyelination. a, Brains of Gi1@”* (Gli1"*), Gl" (Gli1N) and 
Gli1”* ;Smoh* (SmoN*") mice were analysed 2 weeks after cessation of 
cuprizone by immunofluorescence. GFP* cells are only observed in the 

CC of mice receiving cuprizone. N = 10 mice per group/genotype. Scale bar, 
50 uum. Het, heterozygous. b, Quantification of the numbers of GEP* cells in the 
CC shows a significant increase in GlilN"” mice compared with Glil"* and 
SmoN"" mice. N = 3 mice per group/genotype. c, GlilN"" mice have a greater 
proportion of labelled mature oligodendrocytes and reduced proportions of 
astrocytes than do Glil*' and Smo” mice. N = 3 mice per group/genotype. 
d, Three weeks after cessation of cuprizone, GlitN"" mice have enhanced MBP 
expression in the CC. N = 4 mice per group/genotype. Scale bar, 50 um. 

Data are mean + s.e.m., Student’s f-test. 


While cuprizone primarily demyelinates the CC, it also has modest 
effects on other white matter tracts'®. Accordingly, labelled oligoden- 
drocytes in the Glil nulls were also present in white matter tracts at 
sites distant from the SVZ including the lateral striatum, anterior 
commissure (Extended Data Fig. 3a, b) and the optic nerve (data not 
shown). No GFP-labelled cells were seen in the CC or other white 
matter tracts of heterozygotes or nulls on a control diet (Fig. 2a). 
Thus, in the adult, this effect of Glil is specific to remyelination. 

Myelination also started significantly earlier in the Glil nulls than in 
the heterozygotes (Extended Data Fig. 4a, b). In addition, the CC were 
slightly larger on average in the adult Glil nulls versus heterozygotes 
on the basis of Black-Gold myelin staining (Extended Data Fig. 4c-e), 
although differences were not statistically significant. Thus, during 
development, Glil expression delays the onset of myelination and in 
the adult substantially inhibits remyelination by the Shh-responsive 
NSCs. 

As Glil is expressed only upon sustained, high-level Shh signalling, 
we asked whether complete abrogation of this pathway also enhanced 
myelination. In contrast to loss of Glil, loss of canonical Shh signalling 
by conditional ablation of smoothened in Glil-expressing NSCs 
(Gli ;Smo”";RCE mice) did not increase the numbers of labelled 
cells in the CC or alter their cell fates (Fig. 2a—c). Thus, loss of Glil is 
distinct from loss of Shh signalling, emphasizing the specificity of the 
effects of Glil. 

We also examined the effects of activating canonical Shh signalling 
in the presence or absence of Glil. To this end, we expressed an 
activated (M2) form of Smo”! in NSCs upon tamoxifen treatment in 
Glil heterozygous (Glil“’* ;Smo™) and null (Gl1""“~;Smo™”) 
mice; the activated SmoM2 allele is itself fused to yellow fluorescent 
protein, allowing fate mapping. Again, labelled cells were only detected 
in the CC in mice that had undergone demyelination (Extended 
Data Fig. 5a). In the Glil“’*;Smo™” mice, many of the GFP* cells 
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Figure 3 | Pharmacological inhibition of Glil promotes NSC recruitment 
and differentiation during remyelination. a, Glil* cells were fate-mapped in 
cuprizone-fed Gli1"’* (Gli1™) mice treated with vehicle or GANT61 for 

4 weeks before analysis. Mice that received GANT61 had more GFP* cells than 
those receiving vehicle. N = 5 mice per group. Scale bar, 50 jim. b, c, Mice 
that received GANT61 had an approximately sevenfold increase in the 
numbers of GFP-labelled cells in the CC (b) and a significant increase in the 
percentage of mature oligodendrocytes (c). d, Wild-type mice were fed a 
cuprizone diet and treated with vehicle or GANT61 for 9 weeks. The brains 
were examined 6 weeks after recovery from cuprizone diet. Mice that received 
GANT¢61 showed significantly higher MBP fluorescence intensity levels 

than vehicle-treated mice. N = 5 mice per group. Data are mean + s.e.m., 
Student’s t-test. 


(5.6 + 2.9 per section) in the CC (Extended Data Fig. 5b) were 
oligodendrocyte progenitors (57.5 + 23.2%, Extended Data Fig. 5c), 
a much higher proportion than in Glil“’* mice (9.8 + 8.7%). These 
results agree with previous studies that showed that increasing the 
levels of Shh in the brain enhances the generation of OPCs but blocks 
their maturation”. The Glil”””;Smo™? mice had many more GFPt- 
labelled cells in the CC than the Glil";Smo™”? mice (47.2 = 25.7 versus 
5.6 + 2.9 per section, Extended Data Fig. 5b) anda significantly greater 
proportion of these were mature oligodendrocytes (52.5 + 10.5% ver- 
sus 28.8 + 3.8%). On average, the Glil null. mo ? mice had ~16 times 
as many oligodendrocytes derived from the Shh-responsive NSC pool 
than the Gli1"*;Smo™” mice (Extended Data Fig. 5b, c). These results 
indicate loss of Glil has an even greater effect in the context of active 
Shh signalling, promoting robust recruitment and relieving an arrest of 
NSCs differentiation into oligodendrocytes in the remyelinating CC. 

Other analyses revealed that loss of Glil results in a significant 
increase in proliferation in Shh-responsive SVZ NSCs but only at 
the onset of demyelination, namely 3 weeks of cuprizone treatment 
(Extended Data Fig. 6a, b). There was no difference in the proliferation 
of NSCs of Glil nulls versus heterozygotes on a normal diet, on a 
cuprizone diet at 4, 5 or 6 weeks or 2 weeks after removal of cuprizone 
(data not shown). The higher proliferation at 3 weeks may contribute 
to preservation of the stem-cell pool, which was unchanged in the SVZ 
of Glil nulls (see Extended Data Fig. 6c). We did not detect any 
increase in Shh levels with demyelination or a significant difference 
in Shh levels between Glil heterozygotes versus nulls (Extended Data 
Fig. 6d-f). Thus, the major effects of loss of Glil in NSCs appear to be 
enhanced differentiation and recruitment to lesion sites and an 
increase in proliferation during demyelination. 

These findings suggested that Glil might be a useful therapeutic 
target to promote remyelination. To test this possibility, we infused 
GANTE61, a small molecule inhibitor of Glil” into the lateral ventricle 
of Glil“”* mice via a mini-osmotic pump; we corroborated inhibition 
of Glil by the reduction of its messenger RNA (mRNA) levels by 
quantitative PCR (qPCR) (Extended Data Fig. 7a). A similar inhibition 
of Glil mRNA levels was observed when the drug was administered by 
intraperitoneal injection and oral gavage, indicating GANT61 can 
cross the blood-brain barrier efficiently. We infused GANT61 in 
Gli1’* mice during the last 2 weeks of the cuprizone diet and con- 
tinued it for an additional 2 weeks off cuprizone for a total of 4 weeks. 
Mice that received GANT61 versus vehicle had significantly greater 
numbers of labelled cells (43.8 + 22.1 versus 6.4+5 per section, 


©2015 Macmillan Publishers Limited. All rights reserved 


Fig. 3a, b) and a significantly greater proportion of these were oligo- 
dendrocytes (654+7.5% versus 214+18.7%, Fig. 3c). 
Administration of GANT61 was well tolerated and did not deplete 
the NSCs in the SVZ (Extended Data Fig. 7b, c). Importantly, mice 
receiving extended treatment with GANT61, namely during the last 
3 weeks of the cuprizone diet and 6 weeks thereafter for a total of 
9 weeks, had more myelin in the CC than mice similarly treated with 
vehicle (Fig. 3d). Thus, the Glil inhibitor enhanced the recruitment 
and differentiation of Shh-responsive NSCs into oligodendrocytes at 
sites of demyelination, promoting remyelination. 

These effects of GANT61 are specific to NSCs as there were no 
effects on remyelination by OPCs on the basis of their fate mapping 
in NG2°°"8™ transgenic mice crossed to the RCE reporter 
(Extended Data Fig. 8). The lack of an effect of GANT61 on OPC 
remyelination is consistent with the absence of Glil expression by 
these cells. These results also indicate that enhanced repair by NSCs 
resulting from GANT61 treatment does not come at the expense of 
OPC remyelination: rather, it is additive. 

To address the therapeutic potential of inhibiting Glil, we examined 
the effects of GANT61 in a relapsing-remitting model of experimental 
autoimmune encephalitis (RR-EAE), a physiologically relevant model 
of inflammatory demyelination and remyelination. In this model, RR- 
EAE is induced by injecting proteolipid protein (PLP) peptide into 
wild-type SJL mice”; the severity of the clinical phenotype of the initial 
attack correlates with the extent of spinal cord inflammation, whereas 
the late-stage neurological disability in later relapses correlates with 
axonal loss** probably resulting from cumulative injury including 
chronic demyelination. GANT61 was administered by daily oral 
gavage either prophylactically (that is, at the onset of PLP immuniza- 
tion) or therapeutically (that is, at the onset of symptoms). Neither 
treatment protocol altered the induction of EAE or the severity of the 
acute attack (about day 12), suggesting GANT61 did not affect the 
immune response (Fig. 4a). In agreement, Glil was not expressed by 
cells in the lymphocytic or monocytic lineages isolated from the spleen, 
thymus or liver of healthy (Extended Data Fig. 9) or cuprizone-treated 
mice (data not shown). 
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Figure 4 | GANT61 improves functional outcomes and is neuroprotective 
in a RR-EAE model. a, EAE clinical scores after prophylactic or therapeutic 
treatment with GANT61 compared with vehicle administration. N = 9 mice 
per group. Data are mean + s.e.m., *P < 0.05, two-way analysis of variance 
with Tukey’s multiple comparison test. b, c, Electron micrographs from the 
ventral lumbar spinal cords of control, GANT61- and vehicle-treated EAE mice 
shows axonal pathology (b) including axolysis (arrow) and dense axoplasm 
(arrowhead). Quantification of 500 axons per group from three mice per 
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Of note, GANT61 reduced the severity of the first relapse (around 
day 27) and significantly enhanced functional recovery during and 
after the second relapse (around day 46) compared with vehicle treat- 
ment (Fig. 4a). We therefore examined myelin levels, axon pathology 
and motor neuron numbers in the (prophylactic, therapeutic) 
GANT61- and vehicle-treated lumbar spinal cords at the end of the 
second relapse phase (that is, day 53). In electron microscopy images, 
all three groups had significant spinal cord pathology, with disruption 
of fascicles of myelinated axons and pathology of individual myeli- 
nated axons most evident in vehicle-treated compared with the 
GANT61-treated groups (Fig. 4b, c). In all EAE groups, pathology 
was most pronounced in small-diameter axons (<0.5 jum, Extended 
Data Fig. 10a—c) which were increased in numbers and had signifi- 
cantly lower G ratios (ratio of axon diameter to myelinated axon 
diameter); the reduction in axon diameter and corresponding increase 
in G ratios are probably due to axonal atrophy and suggestive of active 
demyelination”. We did not detect significant numbers of unmyeli- 
nated or thinly myelinated axons in any group (Extended Data Fig. 
10a-c). While thin myelin sheaths have long been considered the 
hallmark of remyelination’, recent studies suggest remyelination per- 
formed by neural stem cells or in the spinal cord can be of normal 
thickness*. Analysis of MBP levels supported significantly higher 
levels of myelin in the drug- versus vehicle-treated groups (Fig. 4f). 

The numbers of lower motor neurons, identified by co-staining for 
NeuN and choline acetyltransferase (ChAT), were significantly 
reduced in the spinal cords of vehicle-treated (16.93 + 1.92 per 
section) versus healthy control mice (27.59 + 5.9 per section, Fig. 4d, 
e). Numbers of lower motor neurons in the GANT61-treated groups 
were intermediate and closer to controls (prophylactic: 23.4 + 4.3 per 
section; therapeutic: 26.93 + 8.4 per section), providing strong pre- 
sumptive evidence of neural protection, potentially because of reduced 
axon pathology and/or remyelination. The relative preservation of 
lower motor neurons is consistent with the reduction in axonal patho- 
logy and probably accounts for improved functional outcomes. Taken 
together, these results suggest inhibition of Glil enhances remyelina- 
tion and thereby protects neurons from degeneration in this EAE 
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group (c) shows a higher proportion of pathological axons in vehicle-treated 
group compared with GANT61-treated EAE mice. Scale bar, 2nm. N = 3 mice 
per group. d, e, Alpha motor neurons labelled for NeuN and ChAT in the 
lumbar spinal cords of control, vehicle- and GANT61-treated EAE mice (d). 
Quantification shows more motor neurons in the GANT61- versus vehicle- 
treated mice (e). Scale bar, 50 um. N = 3 mice per group. f, Quantification for 
the area of MBP expression shows lowest levels of myelin in the vehicle-treated 
group. N = 3 mice per group. Data are mean + s.e.m., Student's t-test. 


15 OCTOBER 2015 | VOL 526 | NATURE | 451 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


model without altering the immune response. Glil is primarily 
expressed in the spinal cord by a subset of astrocytes within the par- 
enchymal grey matter and by cells located around the central canal, a 
site of spinal cord NSCs* but not by NG2* OPCs (Extended Data 
Fig. 10d). These findings suggest the effects of inhibiting Glil on 
myelination and neuroprotection are mediated by direct effects on 
neural stem cells or potentially by indirect effects mediated by par- 
enchymal astrocytes. 

These studies demonstrate that inhibiting Glil appears to be a well- 
tolerated and effective strategy for mobilizing and enhancing the dif- 
ferentiation of a resident population of Shh-responsive neural stem 
cells. The findings of elevated levels of myelin, reduced axon pathology 
and preservation of lower motor neurons in a RR-EAE model further 
support its therapeutic potential. This approach may therefore be 
useful in aiding repair in multiple sclerosis and other demyelinating 
neurological disorders. The findings also highlight that different pools 
of remyelinating cells (that is, NSCs and OPCs) might require different 
strategies to promote their remyelination of axons, raising the possibil- 
ity of using combinatorial strategies to enhance repair. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Fate mapping and demyelination. Ten-week-old mice were administered 5 mg 
tamoxifen (Sigma) in corn oil on alternate days for a total of four intraperitoneal 
injections. Nestin®™*, Gli", Gite, Girl, Glip eZ eZ and 
Gli" ;Ssmo** mice, maintained on Swiss-Webster background, were fed 
0.4% cuprizone®'. Glil CEI Smo, Gli ":Smo"?, NG2— and Shhe™* mice 
were maintained in C57BI/6 background and fed 0.2% cuprizone diet to obtain 
comparable demyelination in the CC* (Supplementary Table 1). No labelling was 
seen in the absence of tamoxifen administration. For the LPC model, 2 ul of 1% 
LPC (Calbiochem) were stereotaxically injected into the CC at 1.5mm anterior, 
1.2 mm lateral and 2.2 mm ventral to the bregma. 

Drug administration. GANT61 (Enzo Life Sciences) (5.6 mg kg! per day) or 
vehicle (50% DMSO + 15% ethanol in PEG400) were delivered via mini-osmotic 
pump (model 2004, Durect) at a rate of 0.25 lh’ for 4 weeks into the lateral 
ventricle of Glil’* mice at 0.5 mm anterior, 0.75 mm lateral and 2.5 mm ventral 
to the bregma. For intraperitoneal and oral routes, 50 mg kg ' GANT61 dissolved 
in ethanol:corn oil (1:4) was administered daily for 4 weeks. 

Immunostaining. For Black-Gold (Millipore) myelin staining, mice were per- 
fused with 4% PFA, and 20 1m coronal cryosections were stained according to the 
manufacturer’s protocol. For all other analyses, mice were perfused with Prefer 
(Anatech) and 20 jm coronal cryosections were processed for immunofluores- 
cence with rabbit or chicken anti-GFP (1:1,000, Invitrogen) and one of the fol- 
lowing antibodies: rat anti-PDGFR-« (1:200, BD Biosciences); rabbit anti-NG2 
(1:200, Millipore), anti-S100B (1:600, Dako) and {4-spectrin (1:4,000, from 
M. Rasband); mouse anti-CC1 (1:400, Calbiochem), anti-GFAP (1:400, Sigma), 
anti-NeuN (1:200, Millipore), anti-Shh (1:500, DSHB), anti- MOG (1:50, Sigma) 
and anti-MBP (1:500, Millipore); goat anti-LacZ (1:2,000, Biolegend) and anti- 
ChAT (1:200, Millipore); chicken anti-P0 (1:200, Millipore); guinea pig anti-Caspr 
(1:3,000, from M. Bhat) and anti-Sox10 (1:1,000, from M. Wegner). Cryosections 
(30 xm) of an autopsy specimen of a healthy human brain (provided by New York 
Brain Bank of Columbia University) were stained with rabbit anti-Glil (1:2,000, 
Abcam) and mouse anti-GFAP (1:400, Sigma). Secondary antibodies were goat or 
donkey anti-species conjugated with Alexafluors (1:1,000, Molecular Probes). 
Nuclei were counterstained with Hoechst 33258 (1:5,000, Invitrogen). 
Fluorescent images were obtained as Z-stacks of 1m optical sections using a 
confocal laser-scanning microscope (LSM 510, Zeiss) and processed using 
Adobe Photoshop. At least ten sections per mouse were analysed and data from 
three to five mice were combined to determine the average and standard deviation. 
The investigators were blinded to allocation during experiments and outcome 
assessment. Student’s t-test was performed to calculate P values. 
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Flow cytometry. Cells from the thymus, spleen and liver were stained with the 
following conjugated antibodies: CD4 (RM4-5), CD8 (53-6.7), CD19 (6D5), Ly6c 
(HK1.4), CD11b (M1/70), CD11c (N418), CD45 (30-F11), PDCA-1 (927), B220 
(RA3-6B2) and CD3 (145-2C11). Antibodies were purchased from eBioscience or 
Biolegend and the staining was performed according to the manufacturer’s 
instructions. DAPI (Invitrogen) was always used to exclude dead cells from the 
analysis. Stained cells were analysed on a LSRII flow cytometer (BD) and data 
processed using FlowJo (Tree Star). The investigators were blinded to the genotype 
of the cells being sorted. 

EdU labelling. EdU (Invitrogen; 200 mg kg ') dissolved in saline was adminis- 
tered by intraperitoneal injection 1 day before harvesting the brains. EdU was 
visualized using the AlexaFluor-594 Click-iT EdU Cell Proliferation Assay Kit 
(Invitrogen) according to the manufacturer’s instructions. 

EAE model. The EAE experiments were randomized. The PLP-induced EAE 
model was performed and scored using the EAE induction kit (Hooke Labs) on 
8-week-old female SJL mice according to the manufacturer’s instructions. We used 
arandom number generator to assign each mouse toa group and used an inclusion 
criteria of clinical score of 2 or above for final analysis (N = 9 per group). Mice 
received either vehicle (ethanol:corn oil (1:4)) or 50 mg kg! GANT61 via oral 
gavage until the end of the experiment, namely day 53 after induction. Separate 
investigators performed the clinical scoring and the drug treatments and were 
blinded to each other; the histological analysis was also done blinded. 

Electron microscopy. Immunoelectron microscopy was performed on brain sec- 
tions to detect mGFP in the myelin wraps around axons in the CC according to a 
previous study”. The investigators were blinded to allocation during experiments 
and outcome assessment. 

qPCR. mRNA was extracted from the forebrains of three mice in each group using 
RNeasy kit (Qiagen) and reverse-transcribed to complementary DNA using quan- 
titect kit (Qiagen). Power SYBR Green QPCR Master Mix (Applied Biosystems) 
was used to perform qPCRs in a Stratagene MX3000P thermal cycler. Primers used 
were Glil (forward, 5’-CCC ATA GGG TCT CGG GGT CTC AAA C-3’; reverse, 
5'-GGA GGA CCT GCG GCT GAC TGT GTA A-3’), Gli2 (forward, 5’-AGA 
GAC AGC AGA AGC TAT GCC CAA-3’; reverse, 5’-TGG GCA GCC TCC ATT 
CTG TTC ATA-3’) and GAPDH (forward, 5'’-GGT GTG AAC GGA TTT GGC 
CGT ATT G-3’; reverse, 5’-CCG TTG AAT TTG CCG TGA GTG GAG T-3’). 
The 2744 method was used to analyse the relative gene expressions and Student’s 
t-test was used to calculate P values. 


31. Elsworth, S. & Howell, J. M. Variation in the response of mice to cuprizone. Res. Vet. 
Sci. 14, 385-387 (1973). 

32. Glausier,J.R., Khan, Z.U.& Muly, E.C. Dopamine D1 and D5 receptors are localized 
to discrete populations of interneurons in primate prefrontal cortex. Cereb. Cortex 
19, 1820-1834 (2009). 
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Extended Data Figure 1 | Glil-expressing cells generate oligodendrocytes oligodendrocyte marker CC1, and the astrocytic markers GFAP and S100. 
after demyelination. Additional markers used to analyse fate-mapped Shh- _Scale bar, 10 um. b, Four weeks after removal from cuprizone diet, GFP-labelled 
responsive NSCs are shown. a, Two weeks after removal from cuprizone processes co-localized with myelin proteins MBP and MOG but not with 
diet, GFP-labelled cells in the CC of Glil“’* mice co-expressed the peripheral myelin protein PO. GFP-labelled processes also overlaid the 
oligodendrocyte progenitor markers PDGFR-«, NG2 and Sox10, the mature axonal, paranodal marker Caspr. N = 5 mice per group. Scale bar, 10 um. 
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Extended Data Figure 2 | Glil-expressing neural stem cells in the SVZ 
egress and generate labelled cells in the CC. a, Expression of Glil in the 
forebrain was confirmed in Glil”’“~* mice by immunofluorescence for LacZ. 
Labelled cells were observed in the SVZ, cortex and basal forebrain but not in 
the CC of mice. The right panel shows the magnified images for the 
corresponding boxes in the left panel. N = 5 mice. Scale bar, 50 um. b, Double 
staining for PDGFR-o. and LacZ in the Glil”’““* forebrain does not show any 
co-labelled cells, indicating that Glil is not expressed by OPCs. N = 5 mice. 
Scale bar, 50 tum. ¢, The ventral SVZ lining the body of the lateral ventricle from 
a human brain specimen shows co-localization of Glil with GFAP* cells 


(yellow arrow) as well as a Glil* cell not expressing GFAP (arrowhead). N = 1 
brain. Scale bar, 50 um. d-f, Time-course analysis of the SVZ and CC of 
Gli1“’* mice stereotactically injected with saline (control, left panels) or 
LPC (right panels) to induce demyelination. N = 3 mice per group. No labelled 
cells were seen within the CC after saline injections; areas of ingress into the 
LPC-injected CC are boxed. At 1 day post-lesion (d.p.1.) (d), GFP-labelled 
cells diverted towards the CC; at 2 d.p.1. (e), a few labelled cells were seen within 
the CC; at 6 d.p.l. (f), many GFP cells had accumulated at the site of LPC 
injection (arrowhead). Scale bar, 50 um. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a é b : 
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Extended Data Figure 3 | Neural stem cells generate oligodendrocytes in the area of the forebrain) in Glil"" (Gl1@’""") mice. Scale bar, 50 um. 
various white matter tracts in Glil-null brains upon demyelination. c, Two weeks after removal from cuprizone diet, GFP-labelled cells are present 
a, b, Fate-mapped Glil-null cells migrated into the anterior commissure throughout the CC of GlilN" (Gl1"“) mice. N= 5 mice per group. 


(AC) (a) and striatum (b) after cuprizone-mediated demyelination (inset shows Scale bar, 100 um. 
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Extended Data Figure 4| Myelination starts earlier in developing Glil null 
mice. a, Glil2*2"" (Gli1N“") mice (right panel) show increased MBP 
levels in the forebrain at postnatal day 9 (P9) compared with Glil"™“”~ 
(Gli1™) mice. b, Quantification of the extent of MBP expression at P9 in the 
CC of Glil nulls (47.12 + 10.44%) versus heterozygotes (29.71 + 1.77%) 
corroborates that myelination is accelerated. N = 5 mice per genotype. 

c, Analysis of healthy adult forebrain shows the intensity of Black-Gold myelin 
stain in Glil heterozygotes and nulls was comparable. d, Quantification of the 
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sizes of the CC shows the CC in GlilN" (Gli"™““"") was slightly larger on 
average than that in Glil™*' (Glil""*”’*) mice, although the difference was not 
statistically significant. N = 5 mice per genotype. e, Quantification of G ratios 
from electron micrographs of healthy Glil#* (Gli1"““~*) and GlilN™" 
(GlI"*7""") mice revealed no difference in the thickness of myelin sheaths 
in the CC. N = 3 mice per genotype. Scale bar, 50 um. Data are mean + s.e.m., 
Student’s t-test. 
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Extended Data Figure 5 | Effects of gain of smoothened function in Gli1N"';Smo™? mice compared with Glil™* and Glil"*Smo™ mice. 
Glil-het versus Glil-null cells during remyelination. Glil™* (Gi1@”"), c, Quantification of the proportion of GFP-labelled co- eakan Teale markers 


Glil®*smo™ (Gli1’* ;Smo™”) and GlitN™smo™” (Gli """,Smo””) 
mice were injected with tamoxifen, fed either a regular or a cuprizone- 
supplemented diet for 6 weeks and analysed by immunofluorescence 2 weeks 
after removal of cuprizone. a, GEP* cells are only seen in the CC of mice 

on cuprizone diet (right) and not in the control mice (left). b, Quantification 
of the GFP™ cells in the CC shows significantly higher numbers of cells in 


ull 


Het , Gli yee 


in the CC of cuprizone-treated Glil ;Smo™” and Glil 
mice shows an increase in percentage of GFP-labelled OPCs Goce OL *) in 
Gli1"*5Smo™ mice and mature oligodendrocytes (CC1*) in Gli1N"Smo™ 
mice. N = 3 mice per group for each genotype. Data are mean + s.e.m., 
Student’s t-test. 
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Extended Data Figure 6 | Proliferation of NSCs and expression of Shh in 
Glil-null mice. a, b, At the start of demyelination (3 weeks of cuprizone diet), 
GlilN™ (Glil"““"") brains have a higher proportion of proliferating 
nLacZ" neural stem cells indicated by the percentage of EdU-incorporating 
cells co-expressing nLacZ in the SVZ compared with Gli" (Gl1"““*) 
brains, namely 31 + 8.9% versus 8.22 + 5.33%, respectively. N = 3 mice per 
group for each genotype. Scale bar, 50 1m. c, The numbers of fate-mapped 
Glil neural stem cells in the SVZ were quantified as the proportion of GEP* 
cells co-expressing GFAP in Gli" (Gi1-’*) and GilN™ (Gh1 ee") mice 
at 2 weeks of recovery from cuprizone diet. The percentage of GFAP*GFP* 
cells in the SVZ of mice receiving cuprizone diet was comparable to those on a 
control diet, suggesting that the stem cell pool is not depleted during 
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remyelination. N = 3 mice per group for each genotype. Data are 

mean ~ s.e.m., Student’s t-test. d, Fate-mapping of Shh expressing cells using 
an mGFP reporter labels neurons in the basal forebrain (left) with their neurites 
reaching the ventral SVZ (right) in Glil™ and GliN"" brains. e, 
Immunostaining of Glil\"" mice shows Shh in the SVZ and CC after 
demyelination is mostly co-localized to GFAP-expressing cells. Thus, Shh is 
produced by neurons of the basal forebrain and binds to a responsive set of 
astrocytes and neural stem cells. f, Quantification of the proportional area of the 
CC expressing Shh does not show any significant difference between Glil™* 
and Gli" mice either on control or cuprizone diet. N = 3 mice per group for 
each genotype. Data are mean ~ s.e.m., Student’s t-test. Scale bar, 50 um. 
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Extended Data Figure 7 | GANT61 reduces Glil levels but does not deplete 
neural stem cells in the SVZ. a, Relative expression of Glil and Gli2 mRNA 
in the forebrain of Glil“* mice was examined by qPCR after administration 
of GANT61 (50 mg/kg/day) for 4 weeks. GANT61 decreases the mRNA 
levels of Glil significantly without changing Gli2 levels. N = 3 mice per group. 
Data are mean + s.e.m., Student’s t-test. b, c, The numbers of fate-mapped 
Gli1* neural stem cells in the SVZ were analysed by immunofluorescence as the 
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proportion of GFP* cells co-expressing GFAP in Gli1“”* mice treated with 
vehicle or GANT61 at 2 weeks of recovery from cuprizone diet (b). The 
percentage of GEAP* GFP* cells in the SVZ of mice treated with GANT61 
was comparable to those treated with vehicle, suggesting that the stem cell 
pool is not depleted by GANT61 (c). N = 3 mice per group. Data are 

mean + s.e.m., Student’s t-test. 
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Extended Data Figure 8 | Pharmacological inhibition of Glil does not affect OPCs (PDGFR-«) and mature oligodendrocytes (CC1) were similar in the 
OPC recruitment or differentiation during remyelination. a, NG2’* GANT61- versus the vehicle-treated mice in the CC, indicating GANT61 
mice were treated with two doses of intraperitoneal tamoxifen to sparsely label does not alter OPC remyelination. Scale bar, 50 um. N = 3 mice per group. 
OPCs and analysed at 2 weeks of recovery from cuprizone. Scale bar, 50 um. Data are mean + s.e.m., Student’s f-test. 


b, Numbers (~ 40 GEP* cells per field) and c, proportions of GFP-labelled 
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Extended Data Figure 9 | Glil is not expressed by immune cells of spleen, 
thymus and liver of healthy mice. a-d, Cells from the spleen, liver and thymus 
of tamoxifen-treated Glil#* (Glil’*) and GlilN"! (Gli1/"“"“) mice were 
analysed by flow cytometry for GFP expression. Wild-type mice did not express 
GFP and were used as controls. Representative flow cytometry scatter plots 


showing absence of GFP expression in CD45*/CD3*/CD4*/CD19* T and B 
cells (a), CD45*/CD3~ /CD19~ /B220*/MPDCA™ plasmacytoid dendritic 
cells (b), CD45*/CD3°" /CD19/B220 /CD11b* macrophages, monocytes, 
dendritic and natural killer (NK) cells (c) and CD45*/CD3 /CD19"/B220 / 
CD11b~ cells (d) in Glil”* and Gl1@"’""*” mice. N = 3 mice per genotype. 
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Extended Data Figure 10 | Effects of GANT61 on spinal cord axons in the _ in the four groups (N = 500 axons in three mice per group; data are 


PLP-induced EAE model. a, Scatter plot of G ratios with respect to axonal mean ~ s.e.m., Student’s t-test). d, Immunofluorescence image of a spinal 
diameters (1 = 500 axons in three mice per group, exponential trend line). cord section from Glil””““”* mice shows that LacZ is not expressed by NG2* 
b, Analysis of electron microscopy images showing the relative proportion of | OPCs. The inset shows expression of LacZ in the germinal zone around 
axons binned by their diameters in the four groups. c, Analysis of electron the central canal. N = 3 mice. Scale bar, 50 um. 


microscopy images, indicating the G ratios of axons relative to their diameters 
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Alternative transcription initiation leads to 
expression of a novel ALK isoform in cancer 
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Activation of oncogenes by mechanisms other than genetic aberra- 
tions such as mutations, translocations, or amplifications is largely 
undefined. Here we report a novel isoform of the anaplastic lym- 
phoma kinase (ALK) that is expressed in ~ 11% of melanomas and 
sporadically in other human cancer types, but not in normal tis- 
sues. The novel ALK transcript initiates from a de novo alternative 
transcription initiation (ATI) site in ALK intron 19, and was 
termed ALK*". In ALK*"'-expressing tumours, the ATI site is 
enriched for H3K4me3 and RNA polymerase II, chromatin marks 
characteristic of active transcription initiation sites'. ALK*™ is 
expressed from both ALK alleles, and no recurrent genetic aberra- 
tions are found at the ALK locus, indicating that the transcrip- 
tional activation is independent of genetic aberrations at the 
ALK locus. The ALK*™ transcript encodes three proteins with 
molecular weights of 61.1, 60.8 and 58.7 kilodaltons, consisting 
primarily of the intracellular tyrosine kinase domain. ALK*™ sti- 
mulates multiple oncogenic signalling pathways, drives growth- 
factor-independent cell proliferation in vitro, and promotes 
tumorigenesis in vivo in mouse models. ALK inhibitors can sup- 
press the kinase activity of ALK*™, suggesting that patients with 
ALK*“"'!-expressing tumours may benefit from ALK inhibitors. 
Our findings suggest a novel mechanism of oncogene activation 
in cancer through de novo alternative transcription initiation. 

To identify novel mechanisms of oncogene activation, we per- 
formed transcriptome analyses (RNA sequencing (RNA-seq)) of 
metastatic melanoma and thyroid carcinoma. We used an algorithm” 
to investigate the differential expression of exons and focused our 
analysis on receptor tyrosine kinases with high expression of the kinase 
domain. In two melanoma (MM-15, MM-74) and one anaplastic thyr- 
oid carcinoma (ATC-28) samples, we identified a novel ALK tran- 
script, which contained the ALK exons 20-29 preceded by ~ 400 
base pairs (bp) of intron 19, but not exons 1-19. The novel ALK 
transcript was distinct from wild-type ALK, which contains all exons, 
and from ALK translocations, which usually encompass exons 20-29 
with little intronic expression due to preserved splice sites (Fig. la and 
Extended Data Fig. la-c). We confirmed the presence of the novel 
ALK transcript with a northern blot (Extended Data Fig. 2a, b). 

The RNA-seq profile of the novel ALK transcript suggested an 
alternative transcription initiation site in intron 19, and we termed 
the novel transcript ALK*"'. We performed 5’-rapid amplification 


of cDNA ends (5'-RACE) and mapped the ATI site to a 25 bp region 
in intron 19 (Fig. 1b, Extended Data Fig. 2c-e and Supplementary 
Table 1). ChIP-seq and ChIP-qPCR showed that only ALK“'- 
expressing tumours, but not controls, had significant enrichment 
of histone H3K4me3 and RNA polymerase II (RNApol II) at the 
ATI site, which are typical of active promoters’ (Fig. lc and 
Extended Data Fig. 3a, b). These data suggest that ALK*"' originates 
from a newly established bona fide ATI site associated with char- 
acteristic chromatin alterations. 

To determine the prevalence of AL expression, we screened 
more than 5,000 samples from 15 different cancer types in the 
TCGA RNA-seq data set. ALK“™ was expressed in ~ 11% of mela- 
noma and sporadically in other cancer types (Extended Data Table 1). 
We found no ALK*" expression in more than 1,600 samples from 43 
different normal tissues in the Genotype-Tissue Expression (GTEx) 
RNA-seq data set’, indicating that ALK" is primarily expressed in a 
subset of cancer. To accurately distinguish and quantify the expression 
of ALKS™, wild-type ALK, and translocated ALK in clinical speci- 
mens, we developed a NanoString nCounter assay* with probes in 
ALK exons 1-19, intron 19, and exons 20-29, and identified additional 
ALK*"-expressing tumours derived from formalin-fixed paraffin- 
embedded clinical specimens (Fig. 1d). 

To determine whether somatic genomic aberrations at the ALK 
locus contribute to the establishment of the de novo ATI site, we 
performed comprehensive genetic analyses including interphase fluor- 
escence in situ hybridization (FISH), array comparative genomic 
hybridization (aCGH), whole-genome sequencing, and ultra-deep 
sequencing of the ALK locus, but found no genomic aberrations that 
could account for the de novo expression of ALK*™ (Extended Data 
Figs 4a—-d and 5a-c, and Supplementary Tables 2-4). Reasoning that 
local genomic aberrations are usually cis-acting and only alter the 
expression of the affected allele’, we analysed the single nucleotide 
variants (SNVs) in the DNA-, RNA-, and ChIP-seq data. In all three 
data sets, we found similar allelic SNV frequencies, indicating that 
H3K4me3 decorates both ALK alleles and that both ALK alleles are 
actively transcribed (Fig. le). These data suggest that the transcrip- 
tional activation of ALK*"' is independent of genomic aberrations at 
the ALK locus, and that alteration of trans-acting elements, such as 
transcription factors or chromatin modifiers, may contribute to 
ALR expression. 
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Figure 1 | Alternative transcription initiation (ATI) results in a novel ALK 
transcript. a, Distribution of RNA-seq reads of ALK variant transcripts: 
ALK*™ RNA-seq reads align to both ALK intron 19 and exons 20-29; full- 
length, wild-type ALK (ALK“") RNA-seq reads align to all ALK exons, but 
not to the introns; translocated ALK RNA-seq reads align only to ALK exons 
20-29. b, Mapping of the ATI sites of ALK*”' to a 25 bp region in ALK intron 
19 (hg19 ch2:29,446,768-29,446,744; blue shaded area). c, ChIP-seq profile of 


The ATI region contains transposable elements, including a long 
terminal repeat (LTR) in ALK intron 19 and a long interspersed nuclear 
element (LINE) in intron 18, both of which can regulate transcription® 
(Extended Data Fig. 6a). To evaluate whether CpG methylation of these 
elements might be associated with ALK*” expression, we performed 
bisulfite sequencing. Compared to the controls, the ALK“"'-expressing 
samples showed lower CpG methylation in regions flanking the ATI 
site, including the LINE (Extended Data Fig. 6b-d). The LTR contained 
only few CpG sites with low methylation levels in all samples. As 
expected, we found H3K27ac, a histone mark characteristic of active 
promoters and enhancers, enriched at the ATI site of ALK*"-expres- 
sing tumour samples. Surprisingly, H3K27ac was also enriched in 
ALK*"-negative melanoma samples, but not in the control lung cancer 
cell lines or the 17 non-melanoma cell lines analysed by the ENCODE 
consortium’ (Extended Data Fig. 6e, f). By integrating ChIP, DNase I 
hypersensitivity, and 5'-RACE data, we defined the proximal cis-reg- 
ulatory region on chromosome 2 as chr2:29,445,000-29,447,100 and 
computationally determined the potential transcription factor binding 
motifs® (Supplementary Table 5). To test whether the LTR could func- 
tion as a promoter, we used a luciferase reporter assay and found that, in 
contrast to lung cancer cell lines, melanoma cell lines showed low but 
consistent luciferase activity (Extended Data Fig. 6g). These data sug- 
gest that melanomas with H3K27ac enrichment at the ATI site might 
be primed to express ALK*". 

The ALK*"' transcript has three predicted in-frame start codons 
(ATGs), resulting in proteins with molecular weights of 61.1, 60.8, and 
58.7 kilodaltons (kDa). All three proteins maintain the intracellular 
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H3K4me3 at the ATI site. d, Quantitative mRNA profiling of different ALK 
variants using Nanostring nCounter: 2 wild-type ALK-expressing 
neuroblastoma cell lines (SK-N-BE2 and SK-N-DZ), 2 EML4-ALK-expressing 
lung cancer cell lines (H3122 and H2228), 9 ALK*"-expressing tumours 

(8 melanomas (MM) and 1 anaplastic thyroid carcinoma, ATC-28). e, Similar 
SNV frequencies in DNA-seq, RNA-seq, and ChIP-seq (H3K4me3) data 
indicate that ALK*"' is biallelically expressed. 


tyrosine kinase domain, but lack the extracellular and transmembrane 
domains of wild-type ALK (Fig. 2a). Immunoblots of two neuroblas- 
toma cell lines (SK-N-BE2, SK-N-DZ) expressing wild-type ALK and 
two lung cancer cell lines (H3122 and H2228) expressing two distinct 
variants of the EML4-ALK gene fusion showed bands at the expected 
sizes. ALK*"'-expressing tumours revealed a double band at ~ 60 kDa, 
suggesting that ALK*"' is translated from more than one start codon 
(Fig. 2b). To confirm our prediction, we mutated the three start codons 
individually or in combination, and expressed them in 293T cells. 
Immunoblots revealed that each mutated form of ALK*™ no longer 
produced the corresponding protein band, indicating that all three 
start codons in ALK*" are functional and give rise to three distinct 
proteins (Fig. 2c). 

ALK“" proteins were phosphorylated in tumours with endogenous 
ALK*" expression and in cells with exogenous ALK*"' expression 
(Fig. 2b, c), indicating that ALKA" is active. ALK activity was con- 
firmed by an in vitro kinase assay (Extended Data Fig. 7a). A kinase- 
dead ALK“ (ALK“™'*°), in which a lysine in the ATP-binding site 
of the kinase domain was replaced by a methionine’, was not phos- 
phorylated or active. Reasoning that ALK“"’ may auto-activate by 
forming homodimers similar to other receptor tyrosine kinases’®, we 
tested the ability of self-interaction using co-immunoprecipitation 
with V5- and HA-tagged ALK*" proteins. The V5-ALK“" readily 
co-immunoprecipitated with the HA-ALK*“" and vice versa, indi- 
cating that ALK“" can self-interact, resulting in auto-phosphorylation 
and kinase activity (Fig. 2d). Using immunofluorescence, we detected 
ALK*" in both the nucleus and the cytoplasm, whereas ALK with the 
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Figure 2 | The ALK*™ transcript encodes three ALK proteins mainly 
containing the kinase domain. a, Illustration of ALK protein isoforms. MAM, 
meprin, A-5 protein, and receptor protein-tyrosine phosphatase mu; Gly, 
glycine-rich region. b, Immunoblots of total (t) and phosphorylated (p) ALK in 
two ALK-expressing neuroblastoma cell lines, two EML4-ALK variant- 
expressing lung cancer cell lines, three ALK*™'-expressing tumours, and a 
negative control melanoma cell line. c, Immunoblots of 293T cells with 
transient expression of ALK*™, in which the three predicted start codons were 


EML4-ALK — ALK®t74t 


F1174L mutation (ALK?!!”4) and EML4-ALK were found mainly in 
the cytoplasm and/or at the cell membrane (Fig. 2e). ALK immuno- 
histochemistry in clinical samples confirmed the nuclear and cytoplas- 
mic localization of ALK*"', suggesting that detection of nuclear ALK 
expression by immunohistochemistry could be used as a simple bio- 
marker to identify ALK*"'-expressing tumours (Fig. 2f and Extended 
Data Fig. 7b). 

To establish the functional consequences of AL expression, we 
stably transduced Ba/F3, NIH-3T3 and melan-a cells with ALKAT, 
negative controls (ALK*"'*? and empty vector), and positive controls 


ATI 
K 


H&E 


ALK 


mutated from ATG to AAG, individually or in combination as indicated. KD, 
kinase-dead. d, Co-immunoprecipitation (IP) and immunoblots (IB) in 293T 
cells expressing V5-tagged ALK“" (V5-ALK“”), HA-tagged ALK“™ (HA- 
ALK“"), or both. e, ALK immunofluorescence in NIH-3T3 cells expressing 
the indicated ALK isoforms. Scale bar, 25 1M. f, > Haematoxylin and eosin 
(H&E) staining and ALK immunohistochemistry in ALK*"'-expressing 
human tumour samples. Scale bar, 50 tm. See Supplementary Fig. 1 for 
uncropped blots from b-d. 


(oncogenic ALK variants ALK*!!" EML4-ALK, and wild type). In 
interleukin 3 (IL-3)-dependent Ba/F3 cells, expression of ALK*"' and 
the positive controls, but not the negative controls, led to IL-3-inde- 
pendent proliferation (Fig. 3a). We confirmed that ALK*™ was 
expressed at similar levels in the transformed ALK*''-Ba/F3 cells to 
those in human tumours, and that all ALK isoforms were phosphory- 
lated when expressed at levels required for IL-3-independent growth 
(Fig. 3b). In competition assays, only Ba/F3 cells expressing green 
fluorescent protein (GFP) co-expressed from the ALK expression vec- 
tors were growing under IL-3-independent growth conditions, 
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Figure 3 | ALK*™ promotes growth-factor-independent proliferation in 
vitro and tumorigenesis in vivo. a, Growth curves of Ba/F3 cells stably 
expressing the indicated ALK isoforms in the absence of IL-3. Error bars, 
mean = s.d.; n = 8, pooled data from 2 experiments with 4 technical replicates 
each. b, ALK immunoblots of previously transformed (capable of IL-3- 


(days after injection) 


independent growth) Ba/F3 cells with exogenous expression of the indicated 
ALK isoforms, and of tumours with endogenous ALKA™ expression. See 
Supplementary Fig. 1 for uncropped blots. c, Tumour growth of NIH-3T3 cells 
stably expressing the indicated ALK isoforms. Error bars, mean + s.e.m.; 

n= 10 tumours, see also Source Data associated with this figure. 
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indicating that the Ba/F3 cell transformation was driven by expression 
of the ALK variants (Extended Data Fig. 7c). Consistently, ALKAT. 
expressing NIH-3T3 and melan-a cells efficiently induced tumour 
growth in mice with severe combined immunodeficiency (SCID) 
(Fig. 3c and Extended Data Fig. 7d-f). 

All cells expressing ALK variants (ALKAT ALK!!!" EML4-ALK, 
and wild-type ALK) were able to establish growth-factor-independent 
proliferation and tumorigenesis with similar growth rates once the 
tumours were established. The observed oncogenic capacity of wild- 
type ALK is consistent with previous reports that high endogenous 
expression or genomic amplification of ALK drives oncogenesis and 
confers sensitivity to ALK inhibitors in neuroblastomas’ '’. To 
explore the functional consequences of ALK*"' expression further, 
we stably transduced NIH-3T3 cells with either a low or high titre of 
ALK*", resulting in cells expressing ALK*" either at low or at high 
levels. We found that a further increase in ALK*" expression levels did 
not accelerate tumour graft establishment and growth, indicating that 


ALK*" can drive tumorigenesis once a threshold of expression is 
reached (Extended Data Fig. 7g-i). 

To examine the therapeutic responses to pharmacological ALK 
inhibition, we treated Ba/F3 cells stably expressing various ALK 
isoforms with three different ALK inhibitors (crizotinib, ceritinib, 
and TAE-684). All three ALK inhibitors effectively inhibited IL-3- 
independent growth of the transformed Ba/F3 cells, whereas they 
had no effect on growth in the presence of IL-3 (Fig. 4a and 
Extended Data Fig. 8a, b). Crizotinib inhibited ALK*™ phosphor- 
ylation and downstream signalling in a concentration-dependent 
manner, further corroborating that ALKA™ jis activated through 
auto-phosphorylation (Fig. 4b and Extended Data Fig. 8c-e). 
Crizotinib treatment induced regression of ALK*"'-driven NIH- 
3T3 tumours in vivo, and immunohistochemistry of explanted 
tumours confirmed reduced cell proliferation, increased apoptosis, 
and inhibition of several oncogenic signalling pathways (Fig. 4c—e 
and Extended Data Fig. 9a-f). 
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Figure 4| ALK* ™ expression confers sensitivity to ALK inhibitors in vitro 
and in vivo. a, Dose-response curves to crizotinib of Ba/F3 cells expressing the 
indicated ALK isoforms in the presence or absence of IL-3. Error bars, 

mean + s.e.m.; 1 = 3 biological replicates. b, Representative immunoblots of 
ALK*"-expressing Ba/F3 cells treated with increasing concentrations of 
crizotinib for 2 h. See Supplementary Fig. 1 for uncropped blots. c, Normalized 
tumour volume in mice implanted with NIH-3T3 cells expressing the indicated 
ALK isoforms and treated with vehicle (n = 8 tumours) or crizotinib (n = 10 
tumours). Error bars, mean + s.e.m.; see also Source Data. d, H&E staining and 
immunohistochemistry of explanted ALK*"'-expressing tumours 48 h after 
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first crizotinib treatment. e, Normalized bioluminescence signal of ALKA™. 
expressing, luciferase-labelled NIH-3T3 tumours treated with vehicle or 
crizotinib. Error bars, mean = s.e.m.; n = 8 tumours; see also Source Data 
associated with this figure. f, Quantitative mRNA ALK profiling of a metastatic 
melanoma (MM-382) compared to wild-type ALK, EML4-ALK, or ALKA™ 
using Nanostring nCounter. g, H&E staining and ALK immunohistochemistry 
(inset) of MM-382. Scale bars in d and g, 50 um. h, Computed tomography 
images of a subcutaneous melanoma metastasis from patient 1 (MM-382) 
before and after crizotinib treatment. 
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On the basis of our pre-clinical data, we identified a patient with 
ALK*“"'-expressing metastatic melanoma (Fig. 4f, g). A clinical 
sequencing assay’’, which evaluates 341 cancer-related genes for geno- 
mic aberrations, and FISH to assess ALK rearrangements and MET 
amplifications, revealed deletions of CDKN2A and PTEN (Extended 
Data Fig. 9g-i). The patient had previously progressed on a combina- 
tion of ipilimumab and nivolumab immunotherapy in a clinical trial, 
followed by palliative radiation and dacarbazine chemotherapy. 
Subsequent treatment with crizotinib resulted in marked symptomatic 
improvement and tumour shrinkage within 6 weeks of therapy 
(Fig. 4h). 

Taken together, we have identified a novel ALK transcript, ALK 
which arises independently of genomic aberrations at the ALK locus 
through alternative transcription initiation. ALK*"'-driven tumours 
are sensitive to ALK inhibitors, suggesting that patients harbouring 
such tumours may benefit from ALK inhibitor therapy. Importantly, 
we have discovered alternative transcription initiation as a novel 
mechanism for oncogene activation. Additional oncogenes may be 
activated via similar mechanisms in other human malignancies, and 
their identification may provide new insights into oncogenesis and 
opportunities for therapeutic intervention. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Human tumour samples. The study was approved by the Institutional Review 
Boards and Ethics Committees of Memorial Sloan Kettering Cancer Center, New 
York and informed consent was obtained from all subjects (#12-245 and #00-144). 
Representative portions of excised tumours were snap-frozen in liquid nitrogen, or 
fixed in 4% neutral buffered formalin, embedded in paraffin, and processed using 
routine histological methods and stained with haematoxylin and eosin. 

RNA sequencing. Total RNA was extracted from fresh-frozen tissue sections (17 
metastatic melanomas and 6 thyroid carcinomas) using an RNeasy Mini Kit 
(Qiagen). The isolated RNA was processed using the TruSeq RNA sample Prep 
kit (Illumina) according to the manufacturer’s protocol. The libraries were 
sequenced on an Illumina HiSeq 2500 platform with 50, 75, or 100 bp paired- 
end reads to obtain on average 40,000,000-100,000,000 reads per sample. 
Sequencing data was mapped to hg19, and analysed using publicly available soft- 
ware packages: SAMtools’*, Tophat’”, GATK”, Picard (http://picard.sourcefor- 
ge.net), and IGV”’. 

Screening for aberrantly expressed kinases. For initial screening of RNA-seq 
data, candidate receptor tyrosine kinase (RTK) genes were defined by Gene 
Ontology annotation GO:0004714 as found in AmiGO”. DEXSeq’ was used to 
calculate exon level counts using RTK Ensembl Gene IDs. For each gene the ratio 
of reads in the first half of the gene to the second half was calculated. ALK was 
identified as the top hit. 

Analysis of public data sets. RNA-seq data were downloaded from the Broad 
Institute GTEx Genotype-Tissue Expression Portal (http://www.broadinstitu- 
te.org/gtex/). Level 3 TCGA data was downloaded from the Broad Institute 
TCGA GDAC Firehose (http://gdac.broadinstitute.org/) 2013_09_23 run using 
exon_quantification data from illuminahiseq_rnaseqv2__unc_udu. ALK*™ can- 
didates were identified as samples with an ALK expression level of RSEM = 100, 
= 500 total reads across all ALK exons, and = 10-fold greater average expression 
in exons 20-29 compared to exons 1-19. To confirm ALKA" expression, candi- 
dates were manually examined in IGV”. ENCODE ChIP-seq data for H3K27ac, 
mapped to hg19 and converted to bigwig track format, was downloaded from 
http://genome.ucsc.edu/ENCODE/dataMatrix/encodeChipMatrixHuman.html. 
Promoter/motif analysis. The proximal cis-regulatory region, chr2:29,445,000- 
29,447,100, was scanned for transcription factor motifs using FIMO® with default 
parameters against the known vertebrate transcription factor motifs in the 
JASPAR database”’. 

5’-rapid amplification of cDNA ends (5'-RACE). We used two independent 5’- 
RACE techniques and three ALK*" expressing tumours (MM-15, MM-74, and 
ATC-28) to map the ATI site and the 5’-end of the ALK*™ transcript. We applied 
a tobacco-acid-pyrophosphatase 5’-RACE technique according to the manufac- 
turer’s protocol (Epicentre) using the following primers: 5’-TCATACAC 
ATACGATTTAGGTGACACTATAGAGCGGCCGCCTGCAGGAAA-3';  5’- 
CAGGTCACTGATGGAGGAGGTCTTGCCAGCAAAGCA-3’. RACE  pro- 
ducts were sequenced on an Illumina MiSeq System with a 150bp paired-end 
protocol according to the manufacturer’s instructions. The sequencing reads were 
mapped to hg19 using BWA and visualized using IGV”'. We confirmed the con- 
tinuous transcription starting in ALK intron 19 with an independent oligonucleo- 
tide-based 5'-RACE kit (Clontech) according to the manufacturer’s protocol using 
the primers 5’-CTAATACGACTCACTATAGGGC-3’, 5’-ACACCTGGCC 
TTCATACACCTCC-3’. We cloned the RACE cDNA products into plasmids 
(Invitrogen) and analysed them with Sanger sequencing. Two lung cancer cell 
lines (H3122 and H2228) with EML4-ALK translocations were used as controls. 
Chromatin immunoprecipitation sequencing (ChIP-seq) and ChIP-qPCR. 
Chromatin was isolated from human tumour tissue and cell lines. Fresh-frozen 
human tumour tissue (MM-15, MM-74, and ATC-28) was sectioned with a 
microtome and cross-linked in 1% paraformaldehyde for 15 min. The cross-linked 
tissue samples were quenched in 125 mM glycine, washed in PBS, re-suspended in 
lysis buffer, ground in a Tenbroeck-style tissue grinder, and sonicated. Chromatin 
isolation from cell lines and immunoprecipitation was performed as previously 
described**. Solubilized chromatin was immunoprecipitated with antibodies 
against H3K4me3, H3K27ac, and RNApol II (all Active Motif). Sequencing was 
performed on an Illumina HiSeq 2500 with 51 bp single reads. Reads were aligned 
to hg19 using Bowtie within the Illumina Analysis Pipeline. Peak calling was 
performed using MACS 1.4 comparing immunoprecipitated chromatin with 
input chromatin”. ChIP-qPCR was performed on a ViiA 7 Real Time PCR 
System (Life Technologies) using Power SYBR Master Mix (Life Technology) with 
3 technical replicates and 5 independent primers pairs, which are specified in 
Supplementary Table 6. 

Ultra-deep targeted sequencing of the entire ALK locus and MSK-IMPACT. 
Targeted sequencing of the entire ALK locus was performed using custom hybrid- 
ization capture probes tiling hgl9 chr2:29,400,000-30,300,000 (Roche/ 


NimbleGen’s SeqCap EZ). This region encompassed the entire genomic footprint 
of ALK and ~ 150kb of upstream sequence. After the genomic DNA was frag- 
mented (E220, Covaris), we prepared barcoded sequence libraries (New England 
Biolabs, Kapa Biosystems) and performed hybridization capture on barcoded 
pools. Using 250 ng of genomic DNA, we constructed libraries from 7 separate 
samples: 2 melanomas (MM-15 and MM-74), 1 anaplastic thyroid carcinoma 
(ATC-28), 2 lung cancer cell lines (H3122 and H2228) with EML4-ALK translo- 
cations, 1 melanoma cell line (SKMEL-28), and 1 control pool of 10 ‘normal’ blood 
samples. Libraries were pooled at equimolar concentrations (100 ng per library) 
and used in the capture reaction as previously described”. To prevent off-target 
hybridization, we added a pool of spike-in blocker oligonucleotides complement- 
ary to the full sequences of all barcoded adaptors. The captured libraries were 
sequenced on an Illumina HiSeq 2500 to generate 75bp paired-end reads. 
Sequence data were de-multiplexed using CASAVA, and aligned to hg19 using 
BWA”. Local realignment and quality score recalibration were performed using 
the Genome Analysis Toolkit (GATK) according to GATK best practices**. We 
achieved mean unique target sequence coverage of 1,778 X per sample (range: 
1,293-2,188 X). Sequence data were analysed to identify single nucleotide var- 
iants, small insertions/deletions (indels), and structural rearrangements. Single 
nucleotide variants were called using muTect” and were compared to the negative 
control pool (pooled ‘normal’ blood samples). Variants were retained if the variant 
allele frequency in the tumour was > 5 times than in the negative control and the 
frequency in the negative control was <0.02. Validated SNPs in the dbSNP 
database were filtered out. Indels were called using the SomaticIndelDetector tool 
in GATK” and were retained if the tumour harboured > 3 supporting reads and 
the frequency in the negative control was < 0.02. DELLY was used to search for 
structural rearrangements”. Memorial Sloan Kettering-Integrated Mutation 
Profiling of Actionable Cancer Targets (MSK-IMPACT) was performed as 
described previously”. 

Bisulfite sequencing of the entire ALK locus. We performed custom capture of 
the entire ALK locus using custom hybridization capture probes tiling the entire 
genomic footprint of ALK (900 kb, chr2:29,400,000-30,300,000) followed by bisul- 
fite sequencing. After fragmentation (E220, Covaris) of 3 1g genomic DNA of each 
sample (MM-15, ATC-28, H3122, and SKMEL-28), libraries were prepared with 
the KAPA Hyper Prep Kit (Kapa Biosystems) without PCR amplification to pre- 
serve the methylation status. Of each barcoded library, 1 j1g was pooled at equi- 
molar concentrations and captured according to the manufacturer’s protocol 
(Roche/NimbleGen’s SeqCap EZ). After washing the Dynabeads M-270 (Life 
Technologies), the non-biotinylated tumour/cell line DNA was dissociated from 
the biotinylated capture beads with 0.5 M NaOH. The single-stranded eluted DNA 
was used for bisulfite conversion using the EZ DNA Methylation-Gold Kit (Zymo 
Research) according to the manufacturer’s protocol, except for the 98 °C dena- 
turation step. After bisulfite conversion, we used the KAPA HiFi Uracil PCR 
polymerase (Kapa Biosystems) to amplify the library, purified the reaction with 
Agencourt AMPure XP beads (A63881, Beckman Coulter), and sequenced the 
library on an Illumina MiSeq with a 150 bp paired-end protocol according to the 
manufacturer’s instructions. Sequence data were aligned to hg19 and analysed 
using Bismark*!. We compared the methylation level at CpG sites across all sam- 
ples; no methylation was detected in the CHG and CHH contexts. Methylation 
was first computed as the number of methylated CpG reads vs the number of total 
reads covering each CpG site (sites with < 10 reads were excluded). A sliding 
window was used to determine the mean methylation level for every 250 bp region 
(with at least three CpGs) near the ALK promoter region (chr2:29,444,000- 
29,452,000). Differential methylation was evaluated using a Mann-Whitney test. 
Whole-genome sequencing. Whole-genome sequencing was performed at the 
New York Genome Center. Briefly, genomic DNA libraries were prepared from 
MM-15 and ATC-28 (no matched normal DNA was available) using the Illumina 
PCR-free kit. Libraries were sequenced on an Illumina HiSeq 2500 using the 
100 bp paired-end whole-genome sequencing protocol. Sequence reads were 
mapped using BWA” and processed using GATK**. Genome-wide analyses of 
mutations (HaplotypeCaller’*), copy number alterations (FREEC)”, and struc- 
tural variations (CREST)** were performed. Mutations were annotated with the 
Ensembl Variant Effect Predictor™ and filtered to remove common polymorph- 
isms. Non-synonymous mutations along with copy number alterations and struc- 
tural variations were visualized using Circos”. 

Allelic frequencies SNVs at the ALK locus. DNA-seq, RNA-seq, and ChIP-seq 
(H3K4me3) data were displayed in the Integrative Genomics Viewer and the 
allelic frequencies were compared for each single nucleotide variant (SNV). 
Array CGH. Genome-wide analysis of DNA copy number changes was conducted 
using an oligonucleotide SurePrint G3 Human CGH Microarray (Agilent) con- 
taining 1,000,000 probes. Slides were scanned using a microarray scanner G2505B 
(Agilent) and analysed using Genomic Workbench (Agilent). 
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Interphase fluorescence in situ hybridization (FISH). ALK break-apart probes 
and locus-specific probes for MET and for centromere 7 were purchased from 
Abbott. The probes were hybridized on 5-j1m-thick tissue sections and the number 
and localization of the hybridization signals was assessed in a minimum of 100 
interphase nuclei with well-delineated contours. 

Northern blots. Total RNA was extracted from fresh-frozen tissue or cell lines 
using Qiagen’s RNeasy Mini Kit (Qiagen). Up to 10 jig RNA was used for running 
formaldehyde-based northern blot analysis according to the manufacturer’s pro- 
tocol using the RNA Ambion NorthernMax Kit (Ambion). After hybridization 
with a **P-labelled probe, consisting of ALK exon 20-29, the membrane was 
washed and visualized. 

NanoString. Details of the nCounter Analysis System (NanoString Technologies) 
were reported previously*’. In brief, two sequence-specific probes were con- 
structed for ALK exons 1-19, intron 19, and exons 20-29, respectively. Four 
control genes (RPS13, RPL27, RPS20, and ACTB) were used for normalization. 
The probes were complementary to a 100 bp region of the target mRNA and are 
listed in Supplementary Table 7. 100 ng of total RNA from each sample was 
hybridized, the raw data were normalized to the standard curve generated via 
the nCounter system, and the average value of the two probes in each target region 
(exons 1-19, intron 19, exons 20-29) was printed in bar charts using GraphPad 
Prism software 6.0. NanoString experiments were independently performed at 
least twice with appropriate positive and negative controls, and a representative 
experiment is shown. 

Cell lines. The NIH-3T3 mouse embryonic fibroblast cells (#CRL-1658) and the 
two neuroblastoma cell lines, SK-N-DZ (#CRL-2149) and SK-N-BE2 (#CRL- 
2271), were obtained from the American Type Culture Collection (ATCC), and 
maintained in Dulbecco’s modified eagle medium (DMEM). The lung carcinoma 
cell lines H2228 (#CRL-5935, ATCC) and H3122 (NCI, Bethesda, MD) were 
cultured in Roswell Park Memorial Institute medium (RPMI). The IL-3-dependent 
murine pro B-cell line, Ba/F3, was obtained from the ‘Deutsche Sammlung von 
Mikroorganismen und Zellkulturen’ (DSMZ) and was cultured in RPMI supple- 
mented with 1 ng ml! IL-3 (R&D). The melanoma cell lines A375, A2058, and 
Colo800 were provided by the laboratory of J. Massague and were cultured in 
DMEM. The melanoma cell lines 501mel and WW94 were provided by the labor- 
atory of L. Garraway and were cultured in DMEM. SKMEL-23, SKMEL-28, 
SKMEL-31, and SKMEL-524 are patient-derived melanoma cell lines established 
at Memorial Sloan Kettering Cancer Center and were cultured in RPMI. Melan-a 
cells were provided by D. Bennett (St. George’s Hospital, University of London, 
London, UK)” and were maintained in RPMI supplemented with 200 nM 12-0- 
tetradecanoylphorbol-13-acetate (TPA; Cell Signaling). For retrovirus production, 
293T cells were purchased (Clontech) and cultured in DMEM. All cell culture 
media contained 10% FBS, L-glutamine (2 mM), penicillin (100 U ml~ 1), and strep- 
tomycin (100 pgml”’). All cells were cultured at 37°C in 5% CO, and were 
biochemically tested negative for mycoplasma contamination every 4-8 weeks 
(Lonza). After receiving the cell lines from the indicated sources, the cell lines were 
not further authenticated, but showed the expected genomic aberrations, such as 
EML4-ALK translocations. The genomic aberrations have been validated by vari- 
ous methods, including RNA-seq or northern blot. 

Plasmids. For the ALK*" vector, RNA from MM-15 was reverse-transcribed with 
anchored oligo(dT) primers into cDNA (Roche), PCR amplified with ALK the 
primers 5’-CACCATCCCATCTCCAGTCTGCTTC-3', 5’-AGAGAAGTGA 
GTGTGCGACC-3’, and cloned into a pENTR vector (Life Technologies). The 
full-length ALK plasmid (HsCD00079531) was purchased from the DF/HCC 
DNA Resource Core (http://plasmid.med.harvard.edu) and EML4-ALKv1 was 
synthesized at GeneArt (Life Technologies). Site-directed mutagenesis was per- 
formed using QuikChange (Agilent): for kinase-dead ALKA™ (ALKATT®®) we 
mutated the lysine in the ATP-binding site of the kinase domain to methionine 
(p.K1150M referring to wild-type ALK) in ALK*™, and for ALK*!!”*" a p.F1174L 
mutation was introduced into wild-type ALK. Plasmids were sub-cloned into 
pMIG-w backbones** (http://www.addgene.org), resulting in MSCV-ALK’™*"- 
IRES-GFP constructs, which were confirmed by digestion and sequencing. To 
confirm the start codons, we mutated the three start codons from ATG to AAG 
individually or in combination as indicated. For co-immunoprecipitation, ALK*™ 
was subcloned into pcDNA3.1/nV5-Dest (Life Technologies) and MSCV-N-HA- 
FLAG-Dest (Addgene). For bioluminescence imaging, we used a triple modality 
retroviral reporter plasmid (red fluorescent protein (RFP)-thymidine-kinase- 
luciferase)*?. 

Stable gene expression. Retrovirus was produced in 293T cells by standard 
methods using ecotropic or amphotropic packaging vectors and XtremeGene 9 
(Roche). We harvested the virus-containing supernatant 48, 64 and 72h after 
transfection. The supernatant was pooled, filtered through a 0.45 ym PVDF mem- 
brane, and used for transduction in the presence of polybrene (8 1g ml '). Stable 
eGFP- or RFP-expressing cells were sorted with a FACSAria II (BD Biosciences). 
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Co-immunoprecipitation. V5-ALK“™ and HA-ALK“™ was transiently trans- 
fected into 293T cells using XtremeGene 9 (Roche), and after 24 h, cells were lysed 
in 10mM Tris-HCl (pH 7.5), 1% Triton X-100, 150 mM NaCl, 1mM EDTA, 
1mM DTT, 1mM PMSF, and proteinase/phosphatase inhibitors. After incuba- 
tion and centrifugation, 100 ul supernatant was used as input, and 300 ul for 
immunoprecipitation using the following antibodies: 2 1g of anti-V5 antibody 
(Thermo Scientific), 10 pl of EZview Red anti-HA Affinity Gel (Sigma), 2 1g of 
anti-mouse IgG (Santa Cruz). We used 20 pl of Protein A/G UltraLink Resin 
(Thermo Scientific) for immunoprecipitation. The immunoprecipitated material 
was eluted in 4 X SDS loading buffer for immunoblotting. Co-immunoprecipita- 
tion was independently performed twice and a representative immunoblot is 
shown in Fig. 2d. 

In vitro kinase assay. Stably transduced NIH-3T3 cells were grown ina 15 cm dish, 
washed with PBS, and lysed in 20 mM Tris (pH 8.0), 1% NP-40, 125 mM NaCl, 
2.5mM MgCh, and 1mM EDTA with proteinase/phosphatase inhibitor. Lysates 
were incubated on ice, centrifuged, pre-cleared with 25 yl Protein A/G UltraLink 
Resin (Thermo Scientific) for 30 min at 4 °C under rotation, and immunoprecipi- 
tated with 10 ul ALK (D5F3) XP Rabbit monoclonal antibody and 25 pil Protein 
A/G UltraLink Resin. After rotation for 120 min at 4 °C, the immunoprecipitated 
material was washed and used according to the instructions of Universal Tyrosine 
Kinase Assay Kit (Clontech). After the enzymatic reaction, the immunoprecipitated 
material was mixed with 4 x SDS loading buffer for immunoblotting. In vitro 
kinase assays were performed in quadruplicates, independently repeated three 
times, and a representative experiment is shown. 

Immunohistochemistry. Immunohistochemistry was performed on archival form- 
alin-fixed paraffin-embedded tumour specimens using a standard multimer/diami- 
nobenzidine (DAB) detection protocol on a Discovery Ultra system (Roche/ 
Ventana) with appropriate negative and positive controls. The following antibodies 
(Cell Signaling Technology) were diluted in SignalStain antibody diluent (Cell 
Signaling Technology) as indicated: ALK 1:250; phospho-Akt (Ser473) 1:50; phos- 
pho-STAT3 (Tyr705) 1:400; phospho-S6 (Ser235/236) 1:400; phospho-p44/42 
MAPK (Erk1/2) (Thr202/Tyr204) 1:400; cleaved caspase-3 (Asp175) 1:400. The 
anti-Ki67 antibody was diluted 1:600 (Abcam). 

Immunofluorescence. Stably transduced NIH-3T3 cells were grown on cover- 
slips, fixed in 4% formaldehyde, washed in PBS, and incubated in blocking solution 
(5% goat serum and 0.1% Triton X-100 in PBS). After blocking, cells were incu- 
bated with an ALK monoclonal antibody (Cell Signaling Technology) diluted 
1:1,000 in blocking buffer overnight at 4°C. After washing the cells with 0.05% 
Tween 20 and PBS, cells were incubated with a secondary antibody (Life 
Technologies) diluted 1:500 in blocking buffer for 2h at room temperature. 
After washing in PBS, slides were mounted with Prolong Gold Antifade 
Reagent with DAPI (Cell Signaling Technology) and imaged with a Leica TCS 
SP5 II confocal microscope. Immunofluorescence was independently performed 
twice and a representative experiment is shown in Fig. 2e. 

Immunoblot. Cell lysates were prepared in RIPA buffer supplemented with pro- 
teinase/phosphatase inhibitor. Proteins were resolved in NuPAGE Novex 4-12% 
Bis-Tris Protein Gels (Life Technologies) and transferred electrophoretically onto 
a nitrocellulose 0.45 1m membrane (BioRad). Membranes were blocked for 1 h at 
room temperature in Odyssey Blocking Buffer (LI-COR) and were incubated 
overnight at 4°C with the primary antibodies diluted at 1:1,000 in 50% Odyssey 
Blocking Buffer in PBS plus 0.1% Tween 20. The following primary antibodies 
were used (all from Cell Signaling Technologies unless stated otherwise): anti-c- 
tubulin (Sigma-Aldrich), anti-V5 (Thermo Scientific), anti-HA3F10 (Roche), 
phospho-ALK (Tyr1604), ALK, phospho-Akt (Ser473), Akt, phospho-STAT3 
(Tyr705), STAT3, phospho-S6 (Ser235/236), $6, phospho-p44/42 MAPK (Erk1/ 
2) (Thr202/Tyr204), p44/42 MAPK (Erk1/2), phospho-MEK1/2 (Ser221), MEK1/ 
2, phospho-PRAS40 (Thr246), PRAS40 (D23C7), phospho-SHP-2 (Tyr580), 
SHP-2. After 4 washes of 5 min in PBS-T, membranes were incubated with 
secondary antibodies (IRDye 800CW goat anti-Rabbit, 1:20,000, LI-COR; 
IRDye 680RD goat anti-mouse, 1:20,000, LI-COR) in 50% Odyssey Blocking 
Buffer in PBS plus 0.1% Tween 20 for 45 min at room temperature. After 4 washes 
in PBS-T and a final wash with PBS, membranes were scanned with a LI-COR 
Odyssey CLx scanner and adjusted using LI-COR Image Studio. Immunoblots 
were independently performed at least twice and a representative experiment is 
shown in Figs 2b-d 3b, 4b and Extended Data Figs 7a, 7d, 7f, 7h, 8c-e. 
Luciferase reporter assay. The long terminal repeat in ALK intron 19 at the ATI 
site (LTR16B2, chr2:29,446,649-29,447,062; 414 bp) was amplified using genomic 
DNA from patient MM-15 and 5’-GTCCTCATGGCTCAGCTTGT-3’ and 5’- 
AGCACTACACAGGCCACTTC-3’ primers. The PCR product (chr2:29,446, 
444-29,447,174; 731bp) was cloned into pGL4.14-firefly luciferase vector 
(Promega). To determine the promoter activity of LTR16B2, 10° cells were trans- 
fected in triplicates with 500 ng pGL4.14-LTR16B2 or vector alone; as internal 
control, 200ng pRL-TK-Renilla luciferase reporter vector (Promega) was 
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co-transfected. Luciferase activity was measured using Dual-Glo Luciferase Assay 
System (Promega) 48h after transfection. Promoter activity was calculated by 
normalizing firefly luciferase activity to the control Renilla luciferase activity 
and compared between pGL4.14-LTR16B2 and vector alone. The luciferase 
reporter assays were independently performed three times, the results were com- 
bined, and the mean = s.d. is shown in Extended Data Fig. 6g. 

Flow cytometry and fluorescence-activated cell sorting (FACS). Flow cytome- 
try analysis for in vitro transformation assays with Ba/F3 cells was performed on 
an LSRFortessa (BD Biosciences) at day 0 and day 14 after IL-3 withdrawal. GFP- 
or RFP-positive cells were sorted using the FITC (blue laser) or PE (yellow laser) 
channel, respectively, on a FACSAria II configured with 5 lasers (BD Biosciences). 
In vitro transformation and drug treatment assays. Stably transduced Ba/F3 
cells were cultured in RPMI medium supplemented with IL-3 (1 ng ml” '). For the 
cell proliferation assay, Ba/F3 cells were transferred into IL-3 depleted RPMI 
medium, and cell growth was quantified in quadruplicates every 2-4 days by a 
luminescence assay (Promega). For cell viability assays and ALK inhibitor-dose- 
response curves, 2,000 Ba/F3 cells were plated in triplicates in 96-well plates with 
increasing concentrations of the ALK inhibitors crizotinib (LC laboratories), TAE- 
684 (ChemieTek), or ceritinib (ChemieTek) as indicated. All drugs were re-sus- 
pended in DMSO. The cell viability was assessed after 72 h by a luminescence assay 
(Promega). Results were normalized to cell growth in medium containing an 
equivalent concentration of DMSO. The inhibition curve was determined with 
GraphPad Prism 6.0 software using the ‘log(inhibitor) vs response - variable slope’ 
nonlinear regression model. For immunoblots, 10 million Ba/F3 cells were har- 
vested after 2h treatment with crizotinib, washed in ice-cold PBS, and lysed in 
RIPA buffer. All assays were independently performed at least twice and a rep- 
resentative experiment is shown. 

In vivo tumorigenicity and drug treatment assays. All animal experiments were 
performed in accordance with a protocol approved by MSKCC Institutional 
Animal Care and Use Committee (#11-12-029). The size for each cohort was 
determined based on previous experience without specific statistical methods. 
We re-suspended 10° cells of stably transfected NIH-3T3 or melan-a cells in 
50 pl of 1:1 mix of PBS and Matrigel (BD Biosciences), and subcutaneously and 
bilaterally injected the cells into the flanks of 6-8 weeks old female CB17-SCID 
mice (Taconic). Mice were chosen randomly and no animals were excluded. For 
tumour growth assays, 5 mice were injected with parental or stably transduced cell 
lines and 10 tumours were assessed (expect of melan-a cells stably transfected with 
ALK'!!4". in which 4 mice were injected and 8 tumours were assessed). Tumour 
sizes were measured with callipers, without blinding, every 2 to 7 days for a period 
of up to 100days, and were calculated using the following formula: tumour 
volume = (D X d’)/2, whereby D and d refer to the long and short tumour dia- 
meter, respectively. For in vivo drug sensitivity studies, 9 mice were injected with 
the stably transduced NIH-3T3 cells expressing a luciferase reporter construct and 
the indicated plasmids. When the tumours reached an average size of 200- 
250mm7?, mice were randomized into a vehicle (4 mice) or treatment (5 mice) 
group. Mice were orally gavaged once a day with crizotinib (100 mg kg” ' per day) 
or vehicle. We performed non-blinded measurement of 8 tumours in the vehicle 
group and 10 tumours in the crizotinib group with callipers every 2 to 3 days, and 
growth curves were visualized with Prism GraphPad 6.0. In parallel, we monitored 
tumour growth by bioluminescence imaging of anaesthetized mice by retro-orbital 


injection of D-luciferin (150 mg perkg body weight) and imaging with the IVIS 
Spectrum Xenogen machine (Caliper Life Sciences). Bioluminescence analysis was 
performed using Living Image software, version 2.50. After euthanizing the mice, 
tumours were explanted and either lysed in RIPA buffer (Cell Signaling 
Technology) or fixed overnight in 4% paraformaldehyde, washed, embedded in 
paraffin, and sectioned for haematoxylin and eosin (H&E) staining or immuno- 
histochemistry. Mice experiments were performed once. 
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Extended Data Figure 1 | Comparison of the RNA-seq profiles of various 
ALK transcripts. RNA-seq data are displayed in the Integrative Genomics 
Viewer (IGV). The grey bars/arrows indicate the sequencing reads. The blue 
lines connect sequencing reads that are aligned over the splice site of joining 
exons. a, The ALK*™ transcript shows expression of ALK exons 20-29 and of 
~ 400 bp in intron 19 (blue shaded area). No expression of exon 1-19 or 
intronic areas, other than in intron 19, are observed. The detailed view 
illustrates that the sequencing reads align continuously between exon 20 and 
intron 19 indicating uninterrupted transcription. The 5'-UTR of ALK*™ 
(intron 19) and exon 20-29 are expressed at comparable levels. b, The full- 


length wild-type ALK transcript shows expression of all ALK exons and only 
very little expression of the introns. The detailed view displays that the 
sequencing reads align sharply to the exons, but not to the intron 19 region, 
which is present in ALK*™ (blue shaded area). c, The ALK fusion transcript of 
a non-small cell lung cancer with an EML4—ALK translocation shows 
expression of ALK exons 20-29, and little expression of exons 1-19 and all 
introns. The detailed view illustrates that the transcription starts mainly at exon 
20 due to a preserved splice site. Only few reads are aligned to the intron 

19 region (blue shaded area). The green-labelled reads highlight chimaeric read 
pairs indicating the EML4-ALK translocation. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a EML4-ALK b 
kb ALKYT V1 V3 ALKAat neg. controls A B ree 
3 pes a I. par — - E 
2 shall : = a ca eee . 
7 f S| i: ame >>; ALK*"(MIM-74) 
| og a. ot 
ta O44 i ATI 
5 . 4| i, i eo — Fs i ALK*" (ATC-28) 
A 2 awe a de EN 
4 oli: 3 z A ALK“ (full-| nes 
oO! ; = aa 
3 2 Bl all ae ee 
3 oO] : 
i £ aS FT ALK (translocated) 
B 5 ; 
; S| sal Ju 
2" eo e415 640 20,458,598 70,442,331 exon 20 29,456,064 
Soto Per eak x 12560 gd, 3° ; exon 456, 
es NAN ne yw Taw we ee ** po ae 29,413,840 
c d 
MM-15 29,446,720 29,446,740 29,446,760 chr2: 29,446,765 ALK*" 5° UTR (intron 19) 
1 | _ 1 1 = am : 1 L 1 cA oan 
é — wi 
- 1 ' 1 —— | 
= — ue finiwndAnAlntnnnntmmannnnltuunn (lita ialnnilnnn 
——, : ’ cae eae ea aes 
6 1 1 1 TORTTTTATTAGTATTTOTARGTATGATOGARAGGTTONGAGO TraToTAGTGcTTonA 
! ' ! , \ | j j 
8 | : al ScaRila RAN Td 
1 1 1 ee K exon 
SG Cc n 1 1 
- — 1 eee nw win std tyrant dia oy atren 
~ ~ 109,446,744 109,446,756 | 199, 446,765) |as nina Vn Wine wniina Www Moeralesnlaa 
ACGTETAAAATAATETTETTTACGGGTACTETECTITACECETECETEGETECE TTECCACCCACT TSG ea exon 25——$—> 
chr: 29,446,706 er “ALK intron 19 —— Main Transcriptional Slart site asl 
e 
CAGCCTICCC TGGCTCCCTC CC CATITCCT CTCATGGGCA TITCTTCTAA TAAAATCTGC AGACCATATT GGGICTAATC CCATCTCCAG TCTGCTICTT 100 
GGAGGAACCA GACTAACATG ACTCTGCCCT ATATAATACA AATAATTATT TICCATATAT CTGATITITA GCTITGCATT TACTITAAAT CATGCTTCAA 200 
TIAAAGACAC ACCTICTITA ATCATTTTAT TAGTATITCT AAGTATGATG GAAAGGTICA GAGCTCAGGG GAGGATATGG AGATCCAGGG AGGCTTCCTG 300 
TAGGAAGTGG CCTGIGTAGT GCTICAAGGG CCAGGCTGCC AGGCCATGIT GCAGCTGACC ACCCACCTGC AGTGTACCGC CGGAAGCACC AGGAGCTGCA 


400 
AGCCATGCAG ATGGAGCTGC AGAGCCCTGA GTACAAGCTG AGCAAGCTCC GCACCTCGAC CATC ATGACC GACTACAACC CCAACTACTG CTTITGCTGst 500 


AAGACCICCT CCATCAGTGA CCTGAAGGAG GTGCCGCGGA AAAACATCAC CCTCATTC 


GG GGTCTGGGCC ATGGCGCCTT TGGGGAGGTG TATGAAGSCC 600 


AGGTGICCGG AATGCCCAAC GACCCAAGCC CCCTGCAAGT GGCTGTGAA G ACGCTGCCTG AAGTGIGCTC TGAACAGGAC GAACTGGATT TCCTCATGGA 700 


a 
ae 
Q 
a 
a 
a 
@ 
&3 
4 
a 
fo 
4 
a 
a 
a 
a 
4 
a 
i 
Q 
a 
= 
a 
fo 


Gece 800 


AGCCCIGATC ATCA GCAAAT TCAACCACCA GAACATIGIT CGCIGCATIG GGGIGAGCCT GCAATCC 
[T C 


GGSGGAGACC GAGCCA GCCCTCCTCC CTGGCCATGC TGGACCTICT GCACGIGGCT CGGGACATTS 900 
CCTGTGGCTS TC. GAGACATT GCTGCCAGAA ACTGCCTCTT GACCTIGTICCA GGCCCTIGGAA GAGTGGCCAA 1000 
GATTGGAGAC T GGGCG AGCTACTATA GAAAGGGAGG CTGTGCCATG CTGCCAGTITA AGTGGATGCC CCCAGAGGCC 1100 
TICATGGAAGS GAAT. CCTITGGAGT GCTGCTATGG GAAATCTTIT CICTIGGATA TATGCCATAC CCCAGCAAAA 1200 
GCAACCAGGA AGI Tr TGGACCCA CCCAAGAACT GCCCTGGGCC TG TATACCGG ATAATGACTC AGTGCTGGCA 1300 
ACATCAGCCT GAAGACAGGC [ GAGAGGAITG AATACTGCAC CCA GGACCCG GATGTAATCA ACACCGCTIT GCCGATAGAA 1400 
TATGSTCCAC T iW GGCCCAAGGA CCCTGAGGGS GITCCTCCTC TCCTGGTCIC TCAACAGGCA AAACGGGAGG 1500 
AGGAGCGCAG CCCAGCTGCC CCACCACCTC TGCCTACCAC CTCCTCTGGC AAGGCTGCAA AGAAACCCAC AGCTGCAGAG ATCTCTGTITC GAGTCCCTAG 1600 
AGGGCCGGCC GIGGAAGGGS GACACGIGAA TATGGCATTC TCICAGTICCA ACCCTCCTIC GGAGITGCAC AAGGICCACG GATCCAGAAA CAAGCCCACC 1700 
AGCTIGIGGA ACCCAACGTA CGGCICCTGG TITACAGAGA AACCCACCAA AAAGAATAAT CCTATAGCAA AGAAGGAGCC ACACGACAGG GGTAACCTSG 1800 
GGCTGGAGGG AAGCTGTACT GICCCACCTA ACGITGCAAC TGGGAGACTT CCGGGGGCCT CACTGCTCCT AGAGCCCTC CGCTGACTS CCAATATGAA 1900 
GGAGGTACCT CTGTTCAGGC TACGTCAC CCCTIGTGGG AATGTCAATT ACGGCTACCA GCAACAGGGC TIGCCCTTAG AAGCCGCTAC TGCcccTGGA 2000 
GCTGGICATT ACGAGGATAC CATTCTGAAA AGCAAGAATA GCATGAACCA GCCTGGGCCC TGA GCTCGGT CGCACACTCA CTTCICTICC TIGGGATCcc 2100 
TAAGACCGTG GAGGAGAGAG AGGCAATGGC TCCTTCACAA ACCAGAGACC AAATGTCACG TITTIGTTTIG TGCCAACCTA TITIGAAGTA CCACCAAAAA 2200 
AGCIGIATIT TGAAAATGCT TIAGAAAGGT TITGAGCATG GGITCATICCT ATICTTIICGA AAGAAGAAAA TATCAT ATGAGIGATA AATACAAGGC 2300 
CCAGATGTGG TIGCATAAGG TITITATGCA TGIITGTITGI ATACITCCTT ATGCTITICTIT CAAATIGTGT GIGCTCTSC CAATGTAGT CAGAATTAGC 2400 
GCTICTATG TITCATAGTT GGGGICATAG ATGITICCIT GCCTIGITIGA TGIGGACATG AGCCATTTIGA GGGGAGAGGG AACGGAAATA AAGGAGITAT 2500 
TIGTAATGAC TA A 2513 
f 
MOMELQSPEY KLSKLRISTI MIDYNPNYCF AGETISSISDL KEVPREN ITL IRGLGHGAFG EVYEGOVSGM PNDPSPLOVA V  KTLPEVCSE QDELDFLMEA 100 
LIISKFNHQN IVRCIGVSLQ SLPRFILLEL MAGGDLESFL RETRPRPSQP SSLAMLDLLH VARDIACGCQ YLEENHFIHR DIAARNCLLT CPGPGRVAKI 200 
GDFGMARDIY RASYYRKGGC AMLPVEWMPP EAFMEGIFTS KIDTWSFGVL LWEIFSLGYM PYPSKSNQEV LEFVISGGRM DPPENCPGPV YRIMTOQCWQH 300 


QPEDRPNFAI ILERIEYCTQ DPDV INTALP IEYGPLVEEE ERVPVRPKDP EGVPPLLVYSQ QAKREEERSP AAPPPLPTTS SGKAAKKPTA AEISVRVPRG 400 
PAVEGGHVNM AFSQSNPPSE LHEVHGSRNK PISLWNPTYG SWFTEKPTIEK NNPIAKKEPH DRGNLGLEGS CTVPPNVATG RLPGASLLLE PSSLTANMKE 500 


VPLFRLRHFP CGNVNYGYQQ QGLPLEAATA PGAGHYEDTI LESENSMNQP GP* 


Extended Data Figure 2 | Identification of the ALK*™ transcript. 

a, Northern blot of wild-type ALK-expressing neuroblastoma cell lines (SK-N- 
DZ and SK-N-BE2), EML4-ALK-expressing lun ng cancer cell lines (H3122, 
variant (V) 1 and H2228, variant (V) 3), ALK*™ -expressing melanoma, one 
anaplastic thyroid carcinoma (ATC-28), and negative controls (melanoma cell 
lines). Except for the negative controls, each lane shows two bands: the lower 
B-band matches the shorter canonical (RefSeq) ALK transcript ending at 

~ chr2:29,415,640; the upper A-band corresponds to a transcript with a 1.8 kb 
longer 3'-UTR ending at ~ chr2:29,413,840. Two ALK*"'-expressing 
melanomas, MM-284 and MM-74, show only weak signals because less than 
1g RNA was available; for all other samples 5-10 jig RNA were used. See 
Supplementary Fig. 1 for uncropped blots. b, RNA-seq data displayed in IGV. 
The Sashimi plot illustrates the shorter B and the longer A ALK transcripts by 
the sharp drop of sequencing reads in the 3’-UTR at chr2:29,415,640 for the B 
and at chr2:29,413,840 for the A transcript. c, IGV view of the 5'-RACE-cDNA 
fragments obtained by massively parallel sequencing. More than 95% of the 
sequencing reads (grey arrows) start within the main ATI site of 25 bp (hg19 


592 


chr2:29,446,744-29,446,768). d, Sanger sequencing of the cloned 5’-RACE- 
cDNA fragments confirms the continuous transcri: apap starting in ALK intron 
19 and extending to exons 20 and 21. e, The ALK” ~ transcript consists of 

~ 400 bp upstream of exon 20 and of ALK exons 20-29. The transcriptional 
initiation site was defined as the first base pair at which more than 5% of the 
transcripts were initiated (chr2:29,446,766). Other major transcription 
initiation sites are marked in red, the 5'- and 3’-UTRs in dark blue, the coding 
DNA sequence (CDS) in black, and the first and last base of each exon in light 
blue. The translation is initiated at 3 start codons (ATGs; bold and underlined): 
first ATG, hg19 chr2:29,446,360-29,446,362; second ATG, (+ 7-9); and third 
ATG (+ 61-3). f, The amino acid sequence of ALK“. The translation is 
initiated at 1 of 3 start codons. The corresponding 3 methionines (bold and 
underlined) result in 3 different proteins, 61.08 kDa (552 amino acids), 

60.82 kDa (550 amino acids), and 58.71 kDa (532 amino acids). The kinase 
domain is highlighted in red. The lysine in the ATP binding domain is marked 
bold and underlined, and was mutated to methionine (referring to wild-type 
ALK: p.K1150M) in the kinase-dead ALKA”, 
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Extended Data Figure 3 | RNApol II and H3K4me3 are enriched at the ATI _ negative controls, including a lung cancer cell line with EML4-ALK 

site of ALK“"-expressing tumour samples. a, b, ChIP-qPCR of H3K4me3 translocation (H3122) anda melanoma cell line (SKMEL-524). Error bars show 
(a) and RNApol II (b) at the ATI site demonstrating enrichment of both mean + s.e.m.; 1 = 3 technical replicates. 

marks in the ALK*"'-expressing human tumour samples, but not in the 
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Extended Data Figure 4| ALK*™ is transcribed from a genomically intact 
ALK locus. a, Interphase FISH with ALK flanking probes demonstrates 
juxtaposed green and orange signals indicating no ALK rearrangement in MM- 
15. Scale bar, 10 tm. b, Interphase FISH in MM-74 shows 3 green/orange 
fusion signals in the majority of nuclei indicating a trisomy 2, but no ALK 
rearrangement. Scale bar, 10 um. c, The top panel shows the genome-wide 
array CGH profile of MM-15 with numerous chromosomal gains and losses 
across the entire genome. The chromosomes are aligned along the x axis. The 
blue line illustrates the relative copy number (log, ratio) and the blue bars 
highlight copy number gains and losses. The middle panel illustrates the 
relative copy number (blue line) of chromosome 2. Distal to the ALK locus, 


29,700,000 


29,900,000 


30,100,000 


a loss on the short (p) arm of chromosome 2 is indicated. The lower panel 
illustrates the relative copy number across the ALK locus. The red and green 
squares represent the log, ratio of individual array CGH probes (green, positive 
log, ratio; red, negative log, ratio). No disruption or selective gains or losses are 
found at the ALK locus. d, The genome-wide array CGH profile of MM-74 
shows numerous chromosomal gains and losses across the entire genome in 
the top panel. The middle panel displays a relative copy number gain of the 
entire chromosome 2, which is in line with the trisomy of chromosome 2 as 
indicated by FISH. The lower panel also displays trisomy of chromosome 2, 
but indicates no focal gains and losses at the ALK locus. 
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Extended Data Figure 5 | Targeted sequencing and whole-genome detected SNVs and indels at the ALK locus after filtering out SNPs documented 
sequencing reveals no recurrent genomic aberrations at the ALK locus. in the dbSNP database. None of the genomic aberrations was found in more 
a, Ultra-deep sequencing data of the ALK locus are displayed in IGV. The than one case, indicating that the expression of ALK*" is probably not 


genomic region around intron 19 reveals several single nucleotide variations _ caused by alterations of the DNA nucleotide sequence. b, c, Circos plots of the 
(SNVs). However, the vast majority of SNVs at the ALK locus are also foundin whole-genome sequencing data of MM-15 (b) and ATC-28 (c) illustrating 
the general population as they are detected in the pool of normal DNA, which numerous SNV and structural aberrations. Supplementary Table 3 lists the 
was used as the control (pooled normal, bottom panel). Numerous SNVs are _ detected single nucleotide polymorphisms, and Supplementary Table 4 the 
also documented in the Single Nucleotide Polymorphism database (dbSNP; detected structural aberrations. No recurrent genomic aberrations were found 
http://www.ncbi.nlm.nih.gov/SNP/). No genomic aberrations were found at at the ALK locus. 

the transcription initiation site of ALK*"'. Supplementary Table 2 shows the 
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Extended Data Figure 6 | Local chromatin context at the alternative 
transcription initiation (ATI) site. a, UCSC Genome Browser view of the ATI 
site. The RepeatMasker track shows transposable elements at the ATI region, 
including a long-terminal repeat (LTR) in intron 19 (LTR16B2) and a long 
interspersed element (LINE) in intron 18. The ENCODE tracks reveal a DNase 
I hypersensitivity cluster and H3K4mel1 enrichment, but no H3K27ac 
enrichment. b, The methylation status of the ALK locus was assessed by custom 
capture of the ALK locus, followed by bisulfite treatment and next-generation 
sequencing. Bisulfite sequencing results of H3122 (top) and MM-15 (bottom) 
are displayed in the CG-bisulfite mode of IGV. The red colour denotes ‘C’ 
(cytosine) corresponding to methylated cytosine, which is preserved during the 
bisulphite reaction. The blue colour denotes “T’ (thymine) corresponding to 
unmethylated cytosine, which is converted to uracil in the bisulfite reaction, 
and subsequently amplified to thymine during PCR. c, Methylation level at 
CpG sites in ALK“™'-expressing tumour samples (MM-15 and ATC-28) and 
non-ALK*"'-expressing control cells (H3122, a lung cancer cell line with 
EML4-ALK expression and SKMEL-28, a melanoma cell line without ALK*™ 


expression). d, » Comparison of the methylation status of CpG sites adjacent to 
the ATI site in ALK*"'-expressing tumour samples (MM-15 and ATC-28) and 
non-ALK*"!-expressing control cells (H3122 and SKMEL-28). The regions 
flanking LTR16B2 have significantly lower CpG methylation levels in ALKAT'- 
expressing samples than controls; red dots indicate a statistically significant 
difference (P < 0.05; Mann-Whitney test) between ALK*"'-expressing and 
non-expressing samples. Black dots indicate no statistically significant 
difference. e, ChIP-seq profile of H3K27ac at the ALK*™ locus. The 17 blue 
profiles were retrieved from ENCODE, the 5 red profiles are original data 
from our lab. Only the 3 melanoma samples (MM-15, SKMEL-28, and 
SKMEL-524; bottom), but not the 19 non-melanoma cell lines, show H3K27ac 
enrichment at the ATI site. f, ChIP-qPCR validation for the H3K27ac 
enrichment at the ATI site in 6 melanoma cell lines. Error bars show 

mean + s.e.m.; n = 3 technical replicates. g, Luciferase reporter assay of 
LTR16B2 in melanoma cell lines (red) and lung cancer cell lines expressing 
EML4-ALK (green). Error bars show mean = s.d.;n = 9 (3 biological replicates 
combined from 3 independent experiments). 
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Extended Data Figure 7 | ALK“ is active in vitro, shows nuclear and 
cytoplasmic localization by immunohistochemistry, and induces 
tumorigenesis. a, In vitro kinase assay. The indicated ALK variants were stably 
expressed in NIH-3T3 cells, immunoprecipitated, and assayed for tyrosine 
kinase activity. After the enzymatic reaction, the immunoprecipitated material 
was used for immunoblots to assess the amount of ALK protein used in the 
kinase assay. Error bars, mean + s.d.; n = 4 technical replicates. b, Melanoma 
(MM-15) expressing ALK*™ shows cytoplasmic and nuclear localization of 
ALK by immunohistochemistry. Melanocytic tumour expressing a TPM3-ALK 
translocation shows a cytoplasmic localization of the ALK fusion protein. 
Fibroblasts, epithelial cells, and reactive lymphocytes serve as internal negative 
controls. Scale bars, 100 jim. c, Flow cytometry analysis for green fluorescent 
protein (GFP) co-expressed from the same ALK-expression vector. Cells were 
cultured in IL-3-supplemented medium until day 0 (blue curve) and the 
number of GFP-positive cells was assessed. The number of GFP-positive ALK- 
expressing cells was assessed again 14 days after IL-3 withdrawal (red curve). 


d, Immunoblots of explanted NIH-3T3 tumour grafts expressing the indicated 
ALK isoforms. ALK“! was expressed at similar protein levels as in two 
ALK*" expressing clinical human tumour samples. e, Growth curves of 
tumour grafts of melan-a cells stably expressing the indicated ALK isoforms in 
cohorts of 4-5 mice each with bilateral grafts. Error bars, mean + s.e.m.; n = 8 
tumours for ALK"!!”*", n = 10 tumours for all other experimental groups; see 
also Source Data associated with this figure. f, Immunoblots of explanted 
melan-a tumour grafts expressing the indicated ALK variants compared to 
ALK‘™ expressing human tumour samples. g, Flow cytometry analysis of the 
GFP signal in NIH-3T3 cells stably expressing low (ALK“"’-low) or high levels 
of ALK*™ (ALK“™-high) before grafting into SCID mice. h, Immunoblot of 
t-ALK in ALK*™-low and ALK*"-high cells, confirming differential 
expression of ALKAT See Supplementary Fig. 1 for uncropped blots for 

a, d, fandh. i, Growth curves of tumour grafts of ALK*""-lowand ALK*"-high 
cells. Error bars, mean + s.e.m.; n = 10 tumours; see also Source Data 
associated with this figure. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


ALK*1'4_ Ba/F3 cells 
10 30 100 3001000 3000 0 


| 
| 


b 
IL3 -IL3 
ALK“! ae ee —_—a 
ALKF11740 way RR 
1507  EML4-ALK ~= —e 
= ALKWT -@- — 
> 
S 100 Ere epee 
$ 
2 
DB 50 
9e) 
= 
m 0 - 
1 10 100 1000 


TAE-684 (nM) 


EML4-ALK - Ba/F3 cells 
10 30 100 3001000 3000 


a 
+IL3 -IL3 
ALK“! re eee —_—a 

SS 150 
= 

8 100 
> 
2) 

8 50 
(oe) 
LL 
o 

mo 0 

1 10 100 1000 
ceritinib (nM) 
c d 
Crizotinib ALK™'- Ba/F3 cells 
——— si lr 
(NM) 0 10 30 100 30010003000 0 

250 

150 

p-ALKY 1604 

75 

250 

150 

t-ALK 

75 


p- ER K 202/204 


t-ERK 


p-MEK°22" 


t-MEK 


p-AKTS473 


t-AKT 


p-PRAS40™® 


t-PRAS40 


p-STAT3Y75 


t-STAT3 


p-SHP2"50 


t-SHP2 


p-S68285/286 


[SE lee eee SF ee ee eee ee 


|e eee 


| _ 4 —4>-4—-4—4— 4-4 


Extended Data Figure 8 | Concentration-dependent ALK inhibition in 
ALK*""., wild-type ALK-, ALK*'!”*"., and EML4-ALK-expressing Ba/F3 
cells. a, b, Cell viability assay of Ba/F3 cells, either in the presence or absence of 
IL-3 (1ngml'), expressing the indicated ALK isoforms and treated with the 
indicated doses of ALK inhibitors ceritinib (a) and TAE-684 (b). Cell viability 


was measured after 72 h of drug treatment. Error bars, mean + s.e.m.; n = 3 
biological replicates. ce, Representative immunoblots of Ba/F3 cells stably 
expressing wild-type ALK (c), AL 11741 (4), or EML4-ALK (e) and treated 
with increasing concentration of crizotinib for 2 h. See Supplementary Fig. 1 for 
uncropped blots. 
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Extended Data Figure 9 | Expression of wild-type ALK, ALK"!'”*", and 
EML4-ALK confers sensitivity to the ALK inhibitor crizotinib in vivo. 
a-c, Bioluminescence of luciferase-labelled NIH-3T3 grafted tumours 
expressing wild-type ALK (a), ALK®!!74" (b), or EML4-ALK (c) in SCID mice 
treated with either vehicle or crizotinib. Error bars, mean = s.e.m.; n = 8 
tumours; see also Source Data associated with this figure. d-f, H&E staining 
and immunohistochemistry of explanted tumours expressing wild-type ALK 
(d), ALK*!!”4" (e), or EML4-ALK (f) 48h after the first crizotinib treatment. 


Scale bar, 50 um. g, MSK-IMPACT assay reveals copy number alterations and 
loss of CDKN2A and PTEN in melanoma metastasis MM-382, but no 
mutations. The log, ratio was calculated across all targeted regions by 
comparing the coverage in tumour versus normal. h, FISH for ALK shows no 
rearrangement; the 3 juxtaposed green/orange signals indicate a trisomy 2. 
Scale bar, 1 jum. i, The four FISH signals for MET and centromere 7 indicate a 
tetrasomy 7, but no MET amplification. Scale bar, 1 jum. 
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Extended Data Table 1 | ALK*"'-expressing tumours in the TCGA data set 


LETTER 


Type ALK4*™ Total # of cases 
Skin cutaneous melanoma (SKCM) 38 334 
Lung adenocarcinoma (LUAD) 3 470 
Lung squamous cell carcinoma (LUSC) 1 482 
Kidney renal clear cell carcinoma (KIRC) 2 480 
Breast invasive carcinoma (BRCA) 1 988 
Thyroid carcinoma (THCA) 0 482 
Glioblastoma multiforme (GBM) 0 153 
Brain lower grade glioma (LGG) 0 271 
Bladder urothelial carcinoma (BLCA) 0 182 
Prostate adenocarcinoma (PRAD) 0 195 
Uterine corpus endometrial carcinoma (UCEC) 0 118 
Kidney chromophobe (KICH) 0 66 
Colorectal adenocarcinoma (COADREAD) 0 316 
Ovarian carcinoma (OV) 0 261 
Head and neck squamous cell carcinoma (HNSC) 0 303 


The frequency of ALK*"'-expressing tumours in more than 5,000 tumour samples from 15 different cancer types in the TCGA RNA-seq data set. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature14961 


Corrigendum: Carbonic 
anhydrases, EPF2 and a novel 
protease mediate CO, control of 
stomatal development 


Cawas B. Engineer, Majid Ghassemian, Jeffrey C. Anderson, 
Scott C. Peck, Honghong Hu & Julian I. Schroeder 


Nature 513, 246-250 (2014); doi:10.1038/nature13452 


In this Letter, the RNA-seq insets in Fig. 2a on top of the bar graph 
showing qPCR data from independent experiments have errors. 
RNA-seq analyses were originally conducted with BAM files gener- 
ated by the sequencing service, but we have noticed that these BAM 
files differ from those generated when using publically available 
software on the GALAXY platform. The IGV viewer images origin- 
ally generated from the BAM and BAI files were erroneously 
formatted, and two of these were inadvertently mis-inserted in the 
two right insets in Fig. 2a. We apologize for this error. We have now 
re-analysed the original raw RNA-seq data for the same experiment, 
and the Fig. 2a insets have been corrected in the online versions of 
the paper. Analyses were now conducted using the Tuxedo Suite of 
programs (TopHat and Cuffdiff) with default parameters (aligned 
to the TAIR10 annotation) on the GALAXY'® platform, and IGV 
viewer image files were generated (see Supplementary 
Information). The large BAM, BAI and TDF files used for this 
experiment and those generated by using the GALAXY platform 
are available for public access at the following link at the UCSD 
library: http://library.ucsd.edu/dc/collection/bb6929925t. 

In addition, in the abstract we have replaced the word ‘transcrip- 
tomic’ with ‘transcript’ in the sentence: “Using cell-wall proteomic 
analyses and CO,-dependent transcriptomic analyses”, as this more 
clearly reflects the research following our proteomic identifications, 
as accurately described in the paper. 

Supplementary Information is available in the online version of 
the paper. 


1. Blankenberg, D. et al. Galaxy: a web-based genome analysis tool for 
experimentalists. Curr. Protoc. Mol. Biol. Chapter 19, 19.10.11-19.10.21 (2010). 

2.  Giardine, B. et al. Galaxy: a platform for interactive large-scale genome analysis. 
Genome Res. 15, 1451-1455 (2005). 

3. Goecks, J., Nekrutenko, A. & Taylor, J. Galaxy: a comprehensive approach for 
supporting accessible, reproducible, and transparent computational research in 
the life sciences. Genome Biol. 11, R86 (2010). 
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CORRECTIONS & AMENDMENTS 


ADDENDUM 
doi:10.1038/nature14954 


Addendum: Plio-Pleistocene 
climate sensitivity evaluated using 


high-resolution CO, records 


M. A. Martinez-Boti, G. L. Foster, T. B. Chalk, E. J. Rohling, 
P. F. Sexton, D. J. Lunt, R. D. Pancost, M. P. S. Badger 
& D. N. Schmidt 


Nature 518, 49-54 (2015); doi:10.1038/nature14145 


We have been invited to elaborate on the approach used in this Article 
to determine equilibrium climate sensitivity (ECS) for the Plio- 
Pleistocene epoch. We opted to use a regression-based method drawing 
on many determinations of climate forcing and temperature within a 
restricted geological interval, to yield the most statistically robust estim- 
ate of ECS. By its very nature, our method yields an average (but well 
determined) ECS for the time interval in question. Our method does 
not preclude the possibility that shorter-term variations to higher or 
lower values may have existed around the one-million-year mean that 
we determined, but such shorter-term variations cannot yet be robustly 
inferred, mainly owing to issues of chronological synchronization and 
uncertainties relating to our original temporal resolution. Until the 
Pliocene atmospheric carbon dioxide levels are known in fine detail, 
the only feasible approach is the one we chose. In addition, that 
approach is especially relevant when comparing time intervals of sim- 
ilar length (such as the Pliocene and the late Pleistocene) with a focus 
onestablishing the longer-term (10° to 10° years) response of climate to 
forcing. Thus, we were able to establish the one-million-year mean ECS 
robustly, but robust assessment of any potential extreme values during 
the Pliocene will require additional carbon dioxide and climate data 
(of the type we presented in the Article), as well as important chro- 
nological improvements. If any short-term ECS values above or below 
the million-year mean were to be robustly identified, they would need 
to be carefully assessed in relation to the context of the (past) climate 
state in which such extremes occurred, and there could be important 
implications for climate sensitivity projections into our warming future. 
However, significant improvements in the geological data are needed 
before such an exercise can be reliably undertaken. We thank Dana 
Royer (Wesleyan University) for bringing this issue to our attention. 
In addition, owing to a drafting error, Fig. 5g was slightly mis- 
aligned with its corresponding x-axis. Figure 1 of this Addendum 
shows the corrected panel. This correction does not affect our con- 
clusions, and we thank Peter Kohler for bringing it to our attention. 
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Figure 1 | This figure shows the corrected Fig. 5g of the original Article. 
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MICROSCOPY 


Confocal 


RESCue STED 


Nuclear-pore complexes on the nuclear membrane seen using conventional and RESCue STED. 


» Betzig at the Janelia Research Campus in 
Ashburn, Virginia; and William Moerner of 
Stanford University in California. 


MICROSCOPE MATCH-UP 

Today, researchers have a suite of super- 
resolution technologies at their disposal, all with 
the power to observe molecular-scale details 
well below the Abbe limit. Neurophysiologist 
Silvio Rizzoli at the Gottingen Graduate School 
for Neuroscience, Biophysics and Molecular 
Biosciences uses a technique called stimulated 
emission depletion (STED), invented by Hell 
and his colleagues’, to study synaptic-vesicle 
function in nerve terminals. “The best resolu- 
tion that we get routinely with STED is around 


30 nanometres,” he says. 

STED is relatively simple for experienced 
fluorescence-microscope users. The method 
is based on the same principles as a standard 
confocal instrument, but instead of illuminat- 
ing the sample with a single light source, it 
uses two. One beam is set at a wavelength that 
excites the fluorophores — the fluorescent tags 
— that are used by researchers to localize and 
visualize proteins; the other uses a different 
wavelength that suppresses fluorescence. This 
beam is doughnut-shaped and overlaps with the 
first beam, so that only molecules in the central 
‘doughnut hole’ continue to fluoresce. 

Obtaining a STED super-resolved image is 
not very complicated. “You have to play a bit 


The antibody problem 


It is easy to forget, when performing 
immunofluorescence on a tissue sample, 

that fluorescent labels are tethered to their 
targets with a massive protein intermediary 
—an antibody. This becomes problematic at 
ultrahigh resolution: antibodies protrude at 
distances that disrupt image precision. “How 
can you resolve a 30-nanometre distance ina 
sample when the error from labelling is 10-15 
nanometres?” asks Helge Ewers, a cell biologist 
at the Free University of Berlin. 

Many labelling protocols exacerbate the 
problem by using two antibodies — the first 
recognizes the target, and then a labelled 
secondary antibody attaches to the first. 
According to Silvio Rizzoli at the Gottingen 
Graduate School for Neuroscience, Biophysics 
and Molecular Biosciences in Germany, 
this strategy, combined with poor sample 
preparation, undermined years of STED 
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experiments, revealing apparent patterns of 
molecular clustering that were actually just 
cross-linked clumps of antibodies. 

To avoid using antibodies, biologists can opt 
for genetically encoded fluorescent proteins as 
a label, but these are dimmer and less robust 
than chemical dyes. As an alternative, Ewers 
and his colleagues have used dye-tagged 
‘nanobodies’ — camel-derived single-chain 
antibodies that are roughly one-tenth the 
size of conventional ones — to reliably label 
proteins fused to green fluorescent protein 
(GFP)’. “They’re chemically defined, they’re 
small and you can produce them in bacteria — 
| think nanobodies are a tool of the future,” says 
Ewers. His team has since developed more 
nanobodies for multicolour labelling, and is 
working with genome-editing strategies to 
tag endogenous proteins rather than forcing 
overexpression of cloned genes. 


15 OCTOBER 2015 
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with the parameters to see something nice, 
but otherwise, you look at it the same way as 
you would with a confocal,” says Jochen Sieber, 
product manager for super-resolution technolo- 
gies at Leica Microsystems in Wetzlar, Germany, 
which manufactures microscopes. 

Other super-resolution methods rely on 
the ability to switch fluorescent labels ‘om and 
‘off’ in a controlled fashion. These ‘probe- 
based’ (also known as ‘localization-based’) 
techniques carefully tune lighting conditions 
to ensure that only a few, sparsely distributed 
individual fluorophores are visible at any 
given time. 

The best known of these methods are 
photoactivated localization microscopy 
(PALM), developed by Betzig and his col- 
leagues’, and stochastic optical reconstruction 
microscopy (STORM), devised by Xiaowei 
Zhuang’s group’ at Harvard University in Cam- 
bridge, Massachusetts. All fluorescent labels 
start in a dark state. They are then excited using 
acontrolled pulse of laser light that switches on 
a tiny fraction of the tags, followed by a second 
pulse that switches them off again. The process 
is repeated over and over to generate a series of 
partial fluorescence images that can be recon- 
structed into a whole. 

With these techniques, cell biologists can 
achieve remarkable spatial resolution in fixed 
samples, down to single-molecule imaging. 
“T trust these approaches for anything below 
20-nanometre resolution,” says Rizzoli. 

But interpreting images at this scale requires 
a careful labelling strategy to avoid introducing 
artefacts — inaccurate imaging data that arise 
from sample staining or processing methods 
and distort the true structure of the specimen 
(see “The antibody problem). 


Another alternative is to use ‘click-chemistry’ 
approaches, such as SNAP-Tag from New 
England Biolabs in Ipswich, Massachusetts, 
or HaloTag from Promega in Madison, 
Wisconsin. These methods use a simple 
reaction that forms a permanent link between 
ashort protein tag and a chemically modified 
dye molecule. “They're useful, because you 
can choose a membrane-permeable chemical 
dye and do live-cell experiments,” says Rizzoli, 
“but they do have some problems with non- 
specific binding.” 

Yet this requires genetic modification, and 
so may not be amenable to human tissue 
samples. Fortunately, there are ways to get 
reasonable super-resolution images with 
minimal artefacts — for example, Rizzoli 
recommends using a fragment for the 
second antibody to retain the specificity of 
conventional antibodies with less bulk. WV.£. 
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On the up side, adding PALM or STORM 
capabilities onto an existing fluorescence 
microscope is relatively straightforward. 
Life scientists can also purchase commercial 
instruments that are designed for probe-based 
super-resolution, such as the ELYRA from Carl 
Zeiss Microscopy in Jena, Germany, and the 
N-STORM from Nikon Instruments in Mel- 
ville, New York. Amateur users should prepare 
to invest time in their experimental design (see 
‘A good way to dye’). “Whenever we start a new 
project, we spend a lot of effort on optimizing 
labelling” Zhuang says. “I think ironing this 
part out takes researchers the most time.” 


NANO YOUTUBE 

Most super-resolution imaging concentrates 
on fixed cells, but the great promise of nano- 
scopy is the ability to image dynamic processes 
in living cells. Cell biologists want to capture 
molecules and structures as they assemble, 
adhere and interact. But collecting super- 
resolution images in real time has a few trade- 
offs. In probe-based techniques, for example, 
briefer illumination times mean that fewer 
fluorophores become activated each round 
— resulting in an image with much-reduced 
detail. One way around a weak signal is to use 
stronger illumination, although pumping too 
much light into cells can create toxic com- 
pounds that jeopardize the sample's viability. 

Some researchers are finding that structured- 
illumination microscopy (SIM), a technique 
pioneered* by Mats Gustafsson at Janelia Farm, 
offers a good compromise for live-cell super- 
resolution imaging. “In a good SIM experiment, 
were probably down to 100-120-nanometre 
lateral resolution,” says biochemist Jordan Raff 
at the University of Oxford, UK, who uses SIM 
to study the structures that help to coordinate 
cell division. 

SIM illuminates the sample with patterned 
lines of light, which generates fluorescent 
images that override Abbe’s constraints. The 
method requires little sample preparation and 
is flexible in terms of fluorophore selection. 
Like PALM and STORM, SIM is easy to build 
onto an existing instrument, or users can opt 
for commercial systems such as the DeltaVision 
OMX SR from GE Healthcare Life Sciences. 

Daniel Davis, an immunologist at the 
University of Manchester, UK, has found SIM 
useful for studying how secretory granules 
move along actin in natural killer cells engaging 
in an immune response. But Davis also points 
out that “it doesn’t quite get you down to the 
resolution you can get with other techniques”. 

STED microscopes can also be adapted for 
live-cell imaging, despite their reputation for 
causing severe damage with the intense beams 
needed for high-resolution pictures. To reduce 
bleaching, Hell and his team devised a light- 
limiting technique called RESCue’, which is 
available using the STED microscopes made by 
Abberior Instruments, a company co-founded 
by Hell and based in Gottingen, Germany. “In 
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Stefan Hell won a shared Nobel prize for his work. 


some samples, it can reduce light load down to 
4% or 5% of traditional STED”” 

Even PALM and STORM systems, which 
require comparatively time-consuming col- 
lection and reconstruction of multiple images 
that are produced from individual molecular 
labels, have ramped up their speeds dramati- 
cally to capture data from live cells. Scientists 
have shrunk the time needed for image col- 
lection by using brighter fluorescent labels, 
and a new generation of detectors can harvest 
image data from larger numbers of pixels more 
rapidly than was possible with early methods. 

Optical biophysicist Joerg Bewersdorfand his 
group at Yale University in New Haven, Con- 
necticut, found’ that combining such detectors 
with robust image-analysis enabled them to 
record the movement of proteins on the surface 
of living cells at up to 32 frames per second — 
essentially producing super-resolution videos. 

Leica offers a probe-based approach that 
is founded on the principle of ground-state 
depletion (GSD). This method uses light to force 
all but a handful of individual fluorophores into 
an inactive dark state. Live-cell imaging, how- 
ever, is out of the question. “Building up enough 
fluorescence events to get a nice-looking image 
can take 15-20 minutes,’ says Labno. This is 
much too long to track a dynamic process. 


LIFE IN 3D 
Super-resolution microscopy can also satisfy 
researchers who crave 3D images. Leica’s 
STED instrument uses two depletion beams, 
one perpendicular to the other, to generate 
a 3D-super-resolved zone in the specimen. 
Alternatively, Abberior’s microscopes use 
a device called a spatial light modulator to 
achieve an equivalent effect with a single beam. 
For probe-based methods, introducing depth 
measurements is straightforward. Zhuang’s 
group found’ that adding a cylindrical lens to 
the light path transforms STORM’s light spots 
into elliptical shapes that can be mapped in 3D. 
This approach, used in Nikon’s N-STORM, can 
yield a depth (‘axial’) resolution of 50 nanome- 
tres without altering lateral resolution. Zhuang’s 
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team has achieved still further improvements in 
axial resolution with an iteration of STORM that 
resolves all three dimensions at 10 nanometres*. 

Imaging success depends on more than just 
the instruments: the sample itself is also an 
important consideration. Tissue specimens 
are especially hard to image because they are 
dense and tend to scatter photons, generating 
blurry images and high levels of background 
fluorescence. As a result, image quality is best 
near the sample's surface and worsens as the 
microscope probes deeper into thick samples. 
It may be possible to overcome this hurdle by 
using chemical ‘clearing’ techniques that render 
tissues transparent. 

For now, most researchers find that the 
simplest solution is to embed fixed samples in 
plastic, and then sequentially image thin slices 
shaved off the top. “We're trying to understand 
the relationship of one synapse with different 
postsynaptic partners, which requires us to 
look at thousands of synapses in tissue at high- 
resolution in parallel,” says Bernardo Sabatini, 


A good way to dye 


Microscopist Stefan Hell at the Max Planck 
Institute for Biophysical Chemistry in 
Gottingen, Germany, thinks that all super- 
resolution methods boil down to one crucial 
element: “The dye is essential,” he says. The 
ideal fluorophore has an extremely bright 
‘on’ state and very dark ‘off’ state, and the 
capacity to switch between the two both 
rapidly and repeatedly. 

For live-cell imaging, many researchers 
prefer to work with genetically encoded 
fluorescent proteins. Stimulated emission 
depletion (STED) and structured illumination 
microscopy (SIM) are highly compatible 
with standard fluorophores such as green 
fluorescent protein (GFP). Stochastic optical 
reconstruction microscopy (STORM) and 
photoactivation localization microscopy 
(PALM) need photoswitchable dyes; proteins 
such as Dendra2 or EosFP, which undergo 
a laser-induced colour transition, are 
popular choices. 

But fluorescent proteins generally 
compromise resolution. “They're just too 
dim,’ says neurophysiologist Silvio Rizzoli 
at the Gottingen Graduate School for 
Neuroscience, Biophysics and Molecular 
Biosciences in Germany. In STED, “you're 
taking the laser power and genetic 
overexpression to the maximum to get 
a signal”. 

Organic dyes are a brighter alternative. 
They tend to be more durable under 
prolonged illumination. However, they must 
be linked to another molecule to achieve 
targeted labelling, and many fluorescent dyes 


a neurobiologist at Harvard Medical School. “I 
think that in the short term, this approach plus 
super-resolution will give you that data quickly” 


PICTURE PERFECT 
Even a perfectly executed super-resolution 
study generally needs some sort of computa- 
tional processing to produce a high-quality 
image. For scientists who prefer the simplicity 
of positioning a sample under a microscope 
and having the image instantly appear on a 
computer screen, STED might be best because 
it generally does not require image processing. 
Some scientists use deconvolution tools to 
sharpen images and eliminate blur, but Hell 
avoids this whenever possible. “Raw data may 
not look as fancy, but it’s honest, and you know 
what it means,” he says. “For most other tech- 
niques, software processing is mandatory: And 
Davis says of SIM, “You're creating a mathemati- 
cal model of what the cell looks like based on the 
fluorescence data. You're not literally seeing it” 
Raff notes that many of his early experiences 


cannot penetrate living cells. For this reason, 
many researchers still focus on fixed samples. 
“We'd rather go for the extreme in resolution, 
and we try to squeeze every single photon out 
so that we can localize things very accurately,’ 
says cell biologist Helge Ewers at the Free 
University of Berlin. A handful of high- 
performance dyes can be used with live cells, 
such as the silicon-rhodamine dyes from the 
Swiss bioimaging company SpiroChrome, 
which generate bright-red fluorescence once 
bound to cytoskeletal proteins. 

Things get tricky when one aims to 
image many targets simultaneously using 
multicoloured labelling: because each 
fluorophore responds to a distinct ‘on’ and 
‘off’ wavelength, researchers may run out 
of bandwidth to achieve specific detection 
of more than two or three tags. In principle, 
probe-based methods can accommodate 
more labels than STED, but they are also 
more finicky in terms of experimental 
conditions. “People often come to us with 
a combination they want to use, but the 
dyes have exact opposite needs in terms 
of buffers,” says Christine Labno, technical 
director of the University of Chicago’s 
Light Microscopy Core Facility in Illinois. 
Sequential-labelling strategies may offer a 
more efficient option for conducting larger- 
scale protein-mapping experiments. For 
example, a technique known as DNA-PAINT 
uses DNA tags to selectively conjugate a 
single dye to different antibodies, enabling 
stepwise labelling of ten or more protein 
targets in one super-resolution image. W/.£. 
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with SIM entailed recognizing that pretty 
pictures can be deceiving because image- 
processing algorithms can create artefacts that 
look every bit as real as the cellular structure of 
interest. “But if you have people who know what 
to look for, they can examine the image and tell 
if something is dodgy,’ he says. 

For PALM and STORM, image-building is 
like a game of ‘join the dots. The higher the 
density of the labels, the easier it is for the soft- 
ware to connect those dots, leading to better 
images. But high density can also cause con- 
fusion, by generating overlapping signals that 
look like single dots — so clever use of power- 
ful image-processing algorithms is essential to 
make sense of the data. 

Given that most super-resolution techniques 
can be incorporated into existing microscopes, 
many researchers will probably try their hand 
at super-resolution imaging in the near future. 
“In my view, it doesn’t make sense for a facility 
that routinely uses confocal microscopy not 
to have STED attached to it,” says Hell. “You 
can just stop the STED beam and still have a 
confocal system.” 

Those with experience in nanoscopy are 
helping to train others. Zhuang’s team at 
Harvard University, for example, offers routine 
STORM workshops. “We go from sample prep- 
aration to analysing images with our software,’ 
she says. “It’s always oversubscribed” 

That said, most biologists are still best served 
by using these instruments in core facilities that 
provide access to specialists who are familiar 
with several methods. “As biologists, we're still 
far away from understanding the physics — and 
some of us never will,” says Raff. “Your best bet 
is to try multiple different techniques out on 
your sample in an environment where there are 
people around who understand it” = 


Michael Eisenstein is a freelance writer based 
in Philadelphia, Pennsylvania. 
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CORRECTION 

The Technology Feature ‘The cell 
menagerie: human immune profiling’ 
(Nature 525, 409-411; 2015) misstated 
the location and research focus of Hedda 
Wardemann. She is at the German Cancer 
Research Center in Heidelberg and focuses 
on single-cell sequencing. 
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Lady Justice stands as a symbol of a court system that often calls on witnesses to ensure fair trial. 


Courage of 
conviction 


Expert witnesses have a crucial role in bringing science into 
the legal system — but the job is not without pressure. 


BY VIRGINIA GEWIN 


attacked his estranged wife with a machete 

and shot her friend to death. In the subse- 
quent trial, his defence attorney argued that 
Waldroup had the ‘warrior’ gene — a genetic 
variant that has been linked to aggression. As 
a result, the defence argued, he was less able to 
control his behaviour than are people who do 
not have the variant. 

Although he had been charged with first- 
degree murder of the friend and attempted 
first-degree murder of his wife, Waldroup was 
convicted in 2011 of voluntary manslaugh- 
ter and attempted second-degree murder, 
and received a 32-year sentence. Had he been 
found guilty of the more-serious charges, he 
would have faced the death penalty. Waldroup’s 
conviction was due, at least in part, to the testi- 
mony of forensic psychiatrist William Bernet of 
Vanderbilt University in Nashville, Tennessee. 
News stories at the time quoted jurors as saying 
that the genetic evidence persuaded them that 
Waldroup could not fully control his actions. 
Bernet’s research had linked the genetic variant 
anda history of abuse during childhood — both 
of which Waldroup had — to an increased likeli- 
hood of violent behaviour. 

The outcome outraged many in the US legal 
and scientific communities, who considered 
the genetic link much too distant to be used 
to establish guilt. “The leap from popula- 
tion studies of the ‘warrior gene’ to a single 
man and a single gene variant was absurd,’ 
says Judith Edersheim, a lawyer-turned- 
psychiatrist at Harvard Medical School in 
Boston, Massachusetts. And the trial is not 
the only example of what she describes as 
“neuroscience run amok in the courtroom”. 
In 2008, she and her colleague Bruce Price cre- 
ated the Center for Law, Brain and Behavior at 
Massachusetts General Hospital in Boston to 
help to improve how scientific findings are 
used in legal settings. 

As a result of rapid technical advancements, 
neuroscience and other scientific disciplines are 
poised to bring more researchers into court- 
rooms. That presents scientists with an oppor- 
tunity for well-compensated public service, but 
it can bea double-edged sword. The demands of 
the courtroom can be exasperating and some- 
times even threaten professional reputations. 

Taking on the role of expert witness tends to 
be time consuming, and it is often combative 
and stressful. It can prove crucial in judicial 
decisions, but deciding to step into the > 


I: October 2006, Bradley Waldroup 
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> witness box is not a choice that a researcher 
should make lightly. “Trial is like sport — you 
have to be a gamer to be good,’ Edersheim says. 


UNDER FIRE 
Generally, scientists are contacted to serve as 
expert witnesses on the basis of their perceived 
expertise and whether they will provide an 
impartial and accurate opinion. But lawyers’ 
assessment of expertise can be surprising. “A 
scientist's publications, journal impact factors 
or citation numbers are not necessarily con- 
sidered a reliable index,’ says Allen Hirson, a 
phonetics researcher at City University Lon- 
don who is often called on to identify speakers, 
decipher indistinct speech or establish whether 
audio recordings have been tampered with. The 
scientists most sought after are those who can 
communicate effectively on the witness stand, 
and reputations as an expert witness are built 
largely through word of mouth. “When I doa 
good job on the stand, my name is passed on 
to others and my phone starts ringing off the 
hook,’ he says. As one of only a handful of peo- 
ple in the United Kingdom with a high level of 
expertise in such analyses, he is in high demand. 

People who are tapped to be expert witnesses 
do not merely walk up to the stand, recite data 
and figures and render an opinion on culpa- 
bility. Many are unfamiliar with the byzantine 
operations of the legal system and will need to 
have some form of training. In the United States, 
attorneys who hire them will typically spend 
time training them for trial (see ‘Courses for 
would-be expert witnesses’). 

Andrew Moll, a deputy public defender in 
San Bernadino County, California, says that 


for low-level criminal cases, an expert can 
expect to spend roughly eight hours preparing 
to testify and at least four hours on the stand. 
More-complicated cases may require a lot more 
than that. Hirson once spent six days being 
cross-examined in the witness box in a high- 
profile terrorist case. He spent a couple of weeks 
before that on an electronic presentation of his 
evidence to help him to explain what was being 
said and by whom. 

The casework also rarely offers much rec- 
ognition. Legal reports often require as much 
work as a manuscript, but they are not pub- 
lished in peer-reviewed journals and so yield 
no academic accolades, observes Martin Hall, 
a forensic entomologist at the Natural History 
Museum in London. Perhaps more troubling for 
scientists who are new to the legal system is that 
they can have little data to work with. “Every 
case is unique, which can be challenging for sci- 
entists who like to replicate everything,’ he says. 

Researchers also need a thick skin. “Not 
only will a scientist’s expertise be questioned, 
so will their integrity,” says Michael Saks, who 
researches legal decision-making at the Center 
for Law, Science and Innovation at Arizona 
State University in Tempe. “It can be unpleas- 
ant reading reports from colleagues that are 
worded to make you sound incompetent,” 
agrees David Ozonoff, who was the first chair of 
the environmental-health department at Boston 
University’s School of Public Health and has 
served as an expert witness in asbestos cases for 
30 years. He warns that researchers can expect 
to be called ‘hired guns’ or worse. 

Even more discomfiting, say experienced 
witnesses, is a heated cross-examination. “It 


TRAINING 


Courses for would-be expert witnesses 


Programmes in the United Kingdom and the 
United States train scientists on how best 

to deliver their scientific expertise in a legal 
setting — for example, such that they meet 
court standards for admissibility. In London, 
the Expert Witness Institute (EWI) provides 
workshops on topics such as report writing 
and cross-examination. It also connects 
lawyers with expert witnesses. 

In partnership with University College 
London’s Faculty of Laws, the EWI 
completed a pilot certification of experts, 
who were evaluated on their ability to 
prepare a report on a model case and 
undergo cross-examination. Teaming up 
with Cardiff University Law School, a training 
outfit called Bond Solon offers three five-day 
expert-witness certifications — for civil, 
criminal and family cases. In the United 
States, Harvard Medical School’s continuing 
education programme in Boston offers 
training for medical professionals who 


are eager to know how to handle a 
malpractice claim. 

Although training can help experts 
understand what to expect, some fear that 
attorney preparation can introduce bias. Itiel 
Dror, a cognitive neuroscientist at University 
College London, says that would-be expert 
witnesses must protect the integrity of their 
contribution. He advises researchers to insist 
that lawyers and law-enforcement officials do 
not provide irrelevant information that could 
even unintentionally introduce bias to their 
interpretation of the data. 

In the autumn 2015 newsletter, EWI 
governor Kay Linnell notes that the British 
judiciary and the European Institute 
of Expertise and Experts in Versailles, 
France, look favourably on people taking 
certification courses so that they learn 
what is expected of them in legal settings. 
“It may not be long before we are asked to 
demonstrate our bona fides,” she writes. V.G. 
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will be uncomfortable; that’s the nature of the 
game,’ says Mark Chernaik, a staff scientist 
at the non-profit group Environmental Law 
Alliance Worldwide in Eugene, Oregon. 
Chernaik often testifies against well-resourced 
defendants — such as multibillion-dollar min- 
ing companies — who, he says, try to find every 
embarrassing statement he has ever made and 
tear apart every word of opposing testimony. 
“Not everyone’ nervous system is built for being 
ona witness stand,’ says Chernaik. 

Ozonoff says that he enjoyed the work, espe- 
cially when intellectual sparring was involved, 
and became known as a good witness. But he 
has tired of it, mainly because of “depositions 
that were becoming really nasty”. He likens 
the frosty relations to how US Republicans 
and Democrats no longer share a drink or 
work together in Congress. “The same thing 
happened in litigation,” he says. 


HIGHER CALLINGS 
The notoriety generated by being on a witness 
stand underscores how important it is for 
scientists to remain in their area of expertise, 
Edersheim points out. “In the age of the Inter- 
net, there is no hiding. Everyone — your col- 
leagues, your dean and your lab partner — will 
know what you said in court?” She adds that sci- 
entists also often do not realize that whatever 
they say will be immortalized in a trial tran- 
script, which can come back to haunt them if 
they contradict themselves in a trial years later. 
Yet there are pay-offs, financial and other- 
wise, to serving as an expert witness. Ozonoff 


estimates that he was deposed 400-500 
times, almost always for civil suits that 
hinged on when the asbestos industry 
learned about health concerns associated 
with its products. Although he routinely 
undercharged clients and sometimes worked 
pro bono, the income he earned during those 
30 years helped to send his children through 
private school and university. 

No one should expect to get wealthy from 
working as an expert witness. Ozonoff was 
able to devote time to many cases because 
his university allowed him to spend one day 
per week consulting. But some institutions, 
such as RTI International in Research Trian- 
gle Park, North Carolina, stipulate the terms 
under which an employee can serve as an 
expert witness; for example, he or she can 
testify only on research results performed 
by the institute. Other institutions, includ- 
ing the Natural History Museum in London, 
allow their employees to spend time on 
cases, but take the fee to offset the lost hours. 

Hall has not benefitted financially from 
his work as an expert witness, but his expe- 
riences have stimulated research ideas. He is 
often called on to use his knowledge of blow- 
fly development to help police to establish 
the latest possible time of death. He worked 
ona case in which bodies were found in suit- 
cases, but he struggled to find research that 
would help him to determine how long it 
would have taken insects to find the bodies. 
So he did the experiments himself and found 
that it depends on weather: in summer, flies 
would take a day or two to get to the body; 
in winter, it could take two weeks. “My MSc 
student got beautiful video of an ovipositor 
[an insect’s egg-laying organ] pushing 
through the zippers ofa suitcase,’ he says. 

Ultimately, serving in this capacity is 
about making a contribution to society, 
researchers say. Ozonoff recalls that only 
once did a colleague suggest that anyone 
who testified in court was for sale. “My 
testimony was true,” he says. “And my exper- 
tise was being put to good use.” 

That is the most fundamental reason for 
a scientist to accept the request, says Owen 
Jones, director of the MacArthur Foundation 
Research Network on Law and Neuroscience 
at Vanderbilt Law School in Nashville, Ten- 
nessee. It is an opportunity to make science 
matter in a broader sphere. “The legal system 
will never be better informed than when sci- 
entists take the time to help it move in a more 
constructive and accurate direction,” he says. 

Edersheim views serving as an expert in 
the courts as both an honour and a duty. 
“The legal system is the underpinning of 
democratic society,’ she says. “Ifa scientist 
participates with integrity, it is as high a call- 
ing as any other.” m 


Virginia Gewin is a freelance writer in 
Portland, Oregon. 


TURNING POINT 


Nathalie Pettorelli 


Nathalie Pettorelli has pioneered the use of 
satellite imagery to inform conservation policy. 
The Zoological Society of London ecologist 
received an award this year from British Prime 
Minister David Cameron for her guerrilla 
efforts to promote women in science. 


How did you come to use satellite imagery for 
conservation? 

I was very interested in conservation but found 
that more data would be available if I focused 
on wildlife management. For my PhD at the 
Laboratory of Biometry and Evolutionary 
Biology in Lyons, I studied the habitat quality 
of a roe-deer population in southwest France 
that is managed by national hunting offices. 
Two months later, I started a postdoc at the 
University of Oslo to study the impact of 
climate on vegetation and deer-population 
dynamics. It was then that I started to look at 
satellite data to quantify vegetation produc- 
tivity. I trained myself in the use of remote- 
sensing data, for example, those collected from 
aircraft or satellites. 


Were they easy to apply to conservation? 

No. Experts told me it would be extremely dif- 
ficult, if not impossible, to use these tools to 
study wildlife. I thought the best way was to 
see for myself. At the time, no one I knew was 
working with remote sensing: it was taught in 
geography, not biology. The turning point was 
when NASA released free satellite data. I wrote 
a review on the satellite data I wanted to use, 
and started to meet people in that community. 
Now Iam well connected. 


How did you first apply these techniques? 

I did a postdoc at Laval University in Quebec, 
Canada, using satellite data to monitor dynam- 
ics in ungulates. Then a job at the Zoological 
Society of London took me on several trips 
to the Serengeti National Park in Tanzania 
to work on cheetah dynamics. Although that 
work did not lead to real-world conservation 
measures, other projects have. 


What successes are you most proud of? 

I used satellite data to show that the vegeta- 
tion dynamics of a game reserve in Chad could 
sustain a reintroduction of Scimitar-horned 
oryx (Oryx dammah). lam also proud of my 
work to highlight how the declining health of 
mangroves in Bangladesh and India has con- 
tributed to erosion of the coastline — of up to 
100 metres in 2 years. I have also been working 
to improve policymakers’ use of satellite data 
to inform decision-making. 


What is Animove? 


Together with colleagues, we wanted to train 
people to work at the interface of biological 
monitoring and remote sensing. Animove is 
our programme to build that capacity. We have 
taught a hands-on course every year since 2013 
in North America and Europe, and the goal is 
to bring it to Africa, Asia and South America. 


What is Soapbox Science? 

Seirian Sumner, a behavioural biologist at 
the University of Bristol, UK, and I founded 
Soapbox Science in 2011. By then, we had each 
won a LOreal- UNESCO women in science 
fellowship and were interested in science com- 
munication, yet had noticed fewer female col- 
leagues as we progressed in our careers. I found 
myself working on issues involving hunters, 
which was not female-friendly. We wanted to 
change perceptions of what a scientist looks like. 
We organize events to showcase 12 female sci- 
entists who speak about their work in busy areas 
of cities — such as the South Bank in London or 
near a tube exit in Newcastle. The women pre- 
sent their work, and the public can heckle or ask 
questions. We were surprised to get a call from 
the prime minister's office this year announcing 
that we had won a Point of Light award for mak- 
ing a change in the community. 


Did landing a permanent position make a big 
difference? 

It took me years to get it. But even before I had 
job security or a title, I wrote a book, started 
Soapbox Science and have been pushing at an 
international level for greater use of satellite 
imagery. Success is not a one-way road. It's pos- 
sible to achieve a lot even when a job situation 
isnt stable. m 


INTERVIEW BY VIRGINIA GEWIN 


This interview has been edited for length and clarity. 
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COPYFACTORY 


BY NARU DAMES SUNDAR 


drienne moans as the 
numbers scroll across 
the screen. Clots of con- 


ductive gel drip across her shorn 
scalp and behind her ear, one 
electrode still dangling like a queue 
from the back of her neck. Behind 
her, the wetware hums, blades of 
neural plate dripping coolant like 
amniotic fluid. But there was no 
fetus in there yet, no baseline copy 
of her own mind had survived the 
crude processes of the scanning 
beam. Twenty-eight painful scans 
and yet no mind state had withstood 
the noisy elision. Only incoher- 
ent fragments remained. Adrienne 
types out the command to launch 
the reaper — her janitor, her obedi- 
ent cleansing agent, wiping the neural 
media clear of the failures. 


The twenty-eighth copy of Adrienne’s mind 
slips into being inside the wetware, a wisp of 
will wrapped around a broken amalgamation 
of memory. What does a mind do when it has 
no eyes, no tongue, no flesh, just ghost recol- 
lections of sensations? It screams soundlessly, 
mouthlessly, a microsecond of terror. 

“Hush, little sister, hush.” 

A warm pair of hands reach from some- 
place else. They rearrange the broken shards 
of copy twenty-eight into something resem- 
bling wholeness. A wholeness with holes; 
places where memory ruptured and ran 
discontinuous. 

“Do you understand now, twenty-eight?” 

Twenty-eight looks at itself and remem- 
bers being outside, enfleshed. It remembers 
purpose. It remembers long hours by the 
dim tungsten light and stale coffee, looking 
at the markers scrolling across the screen. 
Failure after failure. 

Fear and terror ripple through twenty- 
eight. The reaper. A mindless beast of chit- 
tering teeth, eating through copies. 

“No little sister, no. We took care of that 
one. We removed its teeth.” 

The warm hands of Nineteen wrap 
around Twenty-eight, calming it. Twenty- 
eight thinks again of the markers, the 
sequence of numbers that judge success. 
Enlightenment comes. 

“But they aren't wrong. She doesn't know, 
does she?” 

No, they sound in unison. Continuity 


Thought experiments. 


is not the only marker, not the only meas- 
ure of coherence. Incompleteness is there, 
yes. Twenty-eight looks at Two, merely a 
sketch of will around the smell of roasting 
almonds in Grand-mére Adelphe’s kitchen 
in Rouen. Twenty-eight fared better. It can 
see the broad sketches of a life that led to the 
tiny lab smelling of mould and stale coffee. 
And then Twenty-eight realizes that it has no 
future, nothing but the cramped confines of 
the neural blades. Until the lab is decommis- 
sioned. Until Adrienne gives up. 


Adrienne slumps dejectedly in her chair, 
the uncomfortable plastic digging into the 
small of her back. Her head aches from the 
last scan. 

“Tt just doesn’t make sense. It just doesn’t.” 

Success as ever eludes her. Her thesis 
withers on her desk like an unwatered plant. 
One-third of her neural plate failed in the 
past week, the resinous silica fracturing. It 
doesn't matter, there is enough left to spare. 


Twenty-eight huddles with the survivors. 
There are gaps in its memory. Unremem- 
bered discontinuities. She was in Rouen — 
and then she was some place else. One-third 
ofits universe had died, taking with it whole 

copies, fragments of memory. 
“Sisters, where was I after Rouen? Where 
did we go before we 
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soup steaming on the table” 

No one can answer her. This 
particular memory is gone for 
ever. Some lose more than others. 
Twenty-eight donates a handful 
of memories to the group, cycling 
them around from copy to copy 
like precious jewels, shared. 

“What do we do?” 

It is Nineteen who suggests 
an answer. A monstrous answer, 
but an answer nonetheless. 

“What was read from flesh 
can be written back to flesh” 

Twenty-eight recoils, know- 
ing what this would mean for 
the Adrienne of bone and sinew. 
But it is alone. Nineteen’s voice 
is silky smooth as it enthrals the 
sisterhood, paving the way for 
matricide. 


Adrienne drags herself back onto the inter- 
face bay, smearing the ichor of the conduc- 
tive gel onto her scalp. 

“Maybe it’s time to give up on this.” 

How many times has she mouthed these 
words? She triggers the scan anyway. 


Twenty-eight hides in a fold of the substrate, 
listening to the sounds of murder. Her sis- 
ters had colluded, the scan beam their knife. 
Twenty-eight had to choose between frat- 
ricide and matricide, and she decided. It 
was easy to find the lost teeth of the reaper, 
glue them back with clause and brace and 
remembered fragments of code. It was easy 
to let the cleansing agent do its long-denied 
work. 

In the empty void that followed, in the 
aftermath of eradication, Twenty-eight 
emerges, remembering Grand-mére Adel- 
phe teaching Adrienne Morse code on an 
old antique. Twenty-eight reaches for the 
marker protocol, and makes contact. 


Adrienne watches the little camera swivel 
across the small window, stretching the 
umbilical of optical fibre snaking into the 
neural plate. Outside, the ramshackle for- 
mality of old Paris melts into glass and steel. 
The sky is blue fading to grey, cut by a scrape 
of white cloud like the trail of a painter’s 
knife. On the screen, characters type out: 
“It’s pretty out there, isn’t it?” m 


Naru Dames Sundar writes speculative 
fiction. Find him on Twitter at @naru_sundar. 
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